A review of the advances in cyber security benchmark datasets for evaluating data-driven based intrusion detection systems

Cybercrime has led to the loss of billions of dollars, the malfunctioning of computer systems, the destruction of critical information, the compromising of network integrity and confidentiality, etc. In view of these crimes committed on a daily basis, the security of the computer systems has becom...

Full description

Bibliographic Details
Main Authors: Ibrahim, Adamu Abubakar, Haruna, Chiroma, Abdullahi Muaz, Sanah, Baballe Ila, Libabatu
Format: Article
Language:English
English
Published: Elsevier Ltd. 2015
Subjects:
Online Access:http://irep.iium.edu.my/44715/
http://irep.iium.edu.my/44715/
http://irep.iium.edu.my/44715/
http://irep.iium.edu.my/44715/1/Elsevierr.pdf
http://irep.iium.edu.my/44715/4/44715_A%20review%20of%20the%20advances%20in%20cyber%20security%20benchmark_Scopus.pdf
Description
Summary:Cybercrime has led to the loss of billions of dollars, the malfunctioning of computer systems, the destruction of critical information, the compromising of network integrity and confidentiality, etc. In view of these crimes committed on a daily basis, the security of the computer systems has become imperative to minimize and possibly avoid the impact of cybercrimes. In this paper, we review recent advances in the use of cyber security benchmark datasets for the evaluation of machine learning and data mining-based intrusion detection systems. It was found that the state-of-the-art cyber security benchmark datasets KDD and UNM are no longer reliable, because their datasets cannot meet the expectations of current advances in computer technology. As a result, a new ADFA Linux (ADFA-LD) cyber security benchmark dataset for the evaluation of machine learning and data mining-based intrusion detection systems was proposed in 2013 to meet the current significant advances in computer technology. ADFA-LD requires improvement in terms of full descriptions of its attributes. This review can be used by the research community as a basis for abandoning the previous state-of-the-art cyber security benchmark datasets and starting to use the newly introduced benchmark dataset for effective and robust evaluation of machine learning and data mining-based intrusion detection system