Skip to main navigation Skip to search Skip to main content

Abstract

Several supervised machine learning models have been proposed and used to detect Android ransomware. These models were trained using different datasets from different sources. However, the age of the ransomware datasets was not considered when training and testing these models. Therefore, the detection accuracy for those models is inaccurate since they learned using features from specific ransomware, old or new ransomware, and they did not learn using diverse ransomware features from different ages. This paper sheds light on the importance of considering the age of ransomware datasets and its effects on the detection accuracy of supervised machine learning models. This proves that supervised machine learning models trained using new ransomware dataset are inefficient in detecting old types of ransomware and vice versa. Moreover, this paper collected a large and diverse dataset of ransomware applications that comprises new and old ransomware developed during the period 2008–2020. Furthermore, the paper proposes a supervised machine learning model that is trained and tested using the diverse dataset. The experiments show that the proposed model is efficient in detecting Android ransomware regardless of its age by achieving an accuracy of approximately 97.48%. Moreover, the results shows that the proposed model outperforms the state-of-the-art approaches considered in this work.

Original languageEnglish
Article number193
JournalInformation (Switzerland)
Volume14
Issue number3
DOIs
StatePublished - Mar 2023

Keywords

  • Android malware
  • information security
  • ransomware
  • supervised machine learning

Fingerprint

Dive into the research topics of 'The Effect of the Ransomware Dataset Age on the Detection Accuracy of Machine Learning Models'. Together they form a unique fingerprint.

Cite this