Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection

DAOUDI, Nadia; ALLIX, Kevin; BISSYANDE, Tegawendé François D Assise; KLEIN, Jacques

doi:10.1007/s10664-021-09955-7

Download

Article (Scientific journals)

Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection

DAOUDI, Nadia; ALLIX, Kevin; BISSYANDE, Tegawendé François D Assise et al.

2021 • In Empirical Software Engineering, 26

Peer Reviewed verified by ORBi

Permalink
https://hdl.handle.net/10993/47296

DOI
10.1007/s10664-021-09955-7

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

Daoudi2021_Article_LessonsLearntOnReproducibility.pdf

Publisher postprint (2.47 MB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Android malware detection; Reproducibility; Replicability; Machine learning

Abstract :

[en] A well-known curse of computer security research is that it often produces systems that, while technically sound, fail operationally. To overcome this curse, the community generally seeks to assess proposed systems under a variety of settings in order to make explicit every potential bias. In this respect, recently, research achievements on machine learning based malware detection are being considered for thorough evaluation by the community. Such an effort of comprehensive evaluation supposes first and foremost the possibility to perform an independent reproduction study in order to sharpen evaluations presented by approaches’ authors. The question Can published approaches actually be reproduced? thus becomes paramount despite the little interest such mundane and practical aspects seem to attract in the malware detection field. In this paper, we attempt a complete reproduction of five Android Malware Detectors from the literature and discuss to what extent they are “reproducible”. Notably, we provide insights on the implications around the guesswork that may be required to finalise a working implementation. Finally, we discuss how barriers to reproduction could be lifted, and how the malware detection field would benefit from stronger reproducibility standards—like many various fields already have.

Disciplines :

Computer science

Author, co-author :

DAOUDI, Nadia ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX

ALLIX, Kevin ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX

BISSYANDE, Tegawendé François D Assise ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX

KLEIN, Jacques ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX

External co-authors :

Language :

English

Title :

Lessons Learnt on Reproducibility in Machine Learning Based Android Malware Detection

Publication date :

2021

Journal title :

Empirical Software Engineering

ISSN :

1382-3256

eISSN :

1573-7616

Publisher :

Springer, United States

Volume :

Peer reviewed :

Peer Reviewed verified by ORBi

Focus Area :

Security, Reliability and Trust

Additional URL :

https://doi.org/10.1007/s10664-021-09955-7

FnR Project :

FNR11693861 - Characterization Of Malicious Code In Mobile Apps: Towards Accurate And Explainable Malware Detection, 2017 (01/06/2018-31/12/2021) - Jacques Klein

Funders :

FNR - Fonds National de la Recherche
University of Luxembourg - UL
SPARTA
Luxembourg Ministry of Foreign and European Affairs

Available on ORBilu :

since 27 May 2021

Statistics

Number of views

694 (58 by Unilu)

Number of downloads

360 (25 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

OpenAlex citations

WoS citations^™