Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
The detection accuracy and model building time of machine learning (ML) classifiers are vital aspects for an intrusion detection system (IDS) to predict attacks in real life. Recently, researchers have introduced feature selection methods to increase the detection accuracy and minimize the model bui...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-08-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/12/9/1424 |
id |
doaj-15d9da6d94274c45986c2380ffe107dd |
---|---|
record_format |
Article |
spelling |
doaj-15d9da6d94274c45986c2380ffe107dd2020-11-25T03:40:08ZengMDPI AGSymmetry2073-89942020-08-01121424142410.3390/sym12091424Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning ClassifiersSaleh Alabdulwahab0BongKyo Moon1Department of Computer Science & Engineering, Dongguk University, 30 Pildong-ro 1-gil Jung-gu, Seoul 04620, KoreaDepartment of Computer Science & Engineering, Dongguk University, 30 Pildong-ro 1-gil Jung-gu, Seoul 04620, KoreaThe detection accuracy and model building time of machine learning (ML) classifiers are vital aspects for an intrusion detection system (IDS) to predict attacks in real life. Recently, researchers have introduced feature selection methods to increase the detection accuracy and minimize the model building time of a limited number of ML classifiers. Therefore, identifying more ML classifiers with very high detection accuracy and the lowest possible model building time is necessary. In this study, the authors tested six supervised classifiers on a full NSL-KDD training dataset (a benchmark record for Internet traffic) using 10-fold cross-validation in the Weka tool with and without feature selection/reduction methods. The authors aimed to identify more options to outperform and secure classifiers with the highest detection accuracy and lowest model building time. The results show that the feature selection/reduction methods, including the wrapper method in combination with the discretize filter, the filter method in combination with the discretize filter, and the discretize filter, can significantly decrease model building time without compromising detection accuracy. The suggested ML algorithms and feature selection/reduction methods are automated pattern recognition approaches to detect network attacks, which are within the scope of the Symmetry journal.https://www.mdpi.com/2073-8994/12/9/1424IDSML classifiersinformation securitynetworkWekaNSL-KDD |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Saleh Alabdulwahab BongKyo Moon |
spellingShingle |
Saleh Alabdulwahab BongKyo Moon Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers Symmetry IDS ML classifiers information security network Weka NSL-KDD |
author_facet |
Saleh Alabdulwahab BongKyo Moon |
author_sort |
Saleh Alabdulwahab |
title |
Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers |
title_short |
Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers |
title_full |
Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers |
title_fullStr |
Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers |
title_full_unstemmed |
Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers |
title_sort |
feature selection methods simultaneously improve the detection accuracy and model building time of machine learning classifiers |
publisher |
MDPI AG |
series |
Symmetry |
issn |
2073-8994 |
publishDate |
2020-08-01 |
description |
The detection accuracy and model building time of machine learning (ML) classifiers are vital aspects for an intrusion detection system (IDS) to predict attacks in real life. Recently, researchers have introduced feature selection methods to increase the detection accuracy and minimize the model building time of a limited number of ML classifiers. Therefore, identifying more ML classifiers with very high detection accuracy and the lowest possible model building time is necessary. In this study, the authors tested six supervised classifiers on a full NSL-KDD training dataset (a benchmark record for Internet traffic) using 10-fold cross-validation in the Weka tool with and without feature selection/reduction methods. The authors aimed to identify more options to outperform and secure classifiers with the highest detection accuracy and lowest model building time. The results show that the feature selection/reduction methods, including the wrapper method in combination with the discretize filter, the filter method in combination with the discretize filter, and the discretize filter, can significantly decrease model building time without compromising detection accuracy. The suggested ML algorithms and feature selection/reduction methods are automated pattern recognition approaches to detect network attacks, which are within the scope of the Symmetry journal. |
topic |
IDS ML classifiers information security network Weka NSL-KDD |
url |
https://www.mdpi.com/2073-8994/12/9/1424 |
work_keys_str_mv |
AT salehalabdulwahab featureselectionmethodssimultaneouslyimprovethedetectionaccuracyandmodelbuildingtimeofmachinelearningclassifiers AT bongkyomoon featureselectionmethodssimultaneouslyimprovethedetectionaccuracyandmodelbuildingtimeofmachinelearningclassifiers |
_version_ |
1724536155105394688 |