Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers

The detection accuracy and model building time of machine learning (ML) classifiers are vital aspects for an intrusion detection system (IDS) to predict attacks in real life. Recently, researchers have introduced feature selection methods to increase the detection accuracy and minimize the model bui...

Full description

Bibliographic Details
Main Authors: Saleh Alabdulwahab, BongKyo Moon
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Symmetry
Subjects:
IDS
Online Access:https://www.mdpi.com/2073-8994/12/9/1424
id doaj-15d9da6d94274c45986c2380ffe107dd
record_format Article
spelling doaj-15d9da6d94274c45986c2380ffe107dd2020-11-25T03:40:08ZengMDPI AGSymmetry2073-89942020-08-01121424142410.3390/sym12091424Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning ClassifiersSaleh Alabdulwahab0BongKyo Moon1Department of Computer Science & Engineering, Dongguk University, 30 Pildong-ro 1-gil Jung-gu, Seoul 04620, KoreaDepartment of Computer Science & Engineering, Dongguk University, 30 Pildong-ro 1-gil Jung-gu, Seoul 04620, KoreaThe detection accuracy and model building time of machine learning (ML) classifiers are vital aspects for an intrusion detection system (IDS) to predict attacks in real life. Recently, researchers have introduced feature selection methods to increase the detection accuracy and minimize the model building time of a limited number of ML classifiers. Therefore, identifying more ML classifiers with very high detection accuracy and the lowest possible model building time is necessary. In this study, the authors tested six supervised classifiers on a full NSL-KDD training dataset (a benchmark record for Internet traffic) using 10-fold cross-validation in the Weka tool with and without feature selection/reduction methods. The authors aimed to identify more options to outperform and secure classifiers with the highest detection accuracy and lowest model building time. The results show that the feature selection/reduction methods, including the wrapper method in combination with the discretize filter, the filter method in combination with the discretize filter, and the discretize filter, can significantly decrease model building time without compromising detection accuracy. The suggested ML algorithms and feature selection/reduction methods are automated pattern recognition approaches to detect network attacks, which are within the scope of the Symmetry journal.https://www.mdpi.com/2073-8994/12/9/1424IDSML classifiersinformation securitynetworkWekaNSL-KDD
collection DOAJ
language English
format Article
sources DOAJ
author Saleh Alabdulwahab
BongKyo Moon
spellingShingle Saleh Alabdulwahab
BongKyo Moon
Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
Symmetry
IDS
ML classifiers
information security
network
Weka
NSL-KDD
author_facet Saleh Alabdulwahab
BongKyo Moon
author_sort Saleh Alabdulwahab
title Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
title_short Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
title_full Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
title_fullStr Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
title_full_unstemmed Feature Selection Methods Simultaneously Improve the Detection Accuracy and Model Building Time of Machine Learning Classifiers
title_sort feature selection methods simultaneously improve the detection accuracy and model building time of machine learning classifiers
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-08-01
description The detection accuracy and model building time of machine learning (ML) classifiers are vital aspects for an intrusion detection system (IDS) to predict attacks in real life. Recently, researchers have introduced feature selection methods to increase the detection accuracy and minimize the model building time of a limited number of ML classifiers. Therefore, identifying more ML classifiers with very high detection accuracy and the lowest possible model building time is necessary. In this study, the authors tested six supervised classifiers on a full NSL-KDD training dataset (a benchmark record for Internet traffic) using 10-fold cross-validation in the Weka tool with and without feature selection/reduction methods. The authors aimed to identify more options to outperform and secure classifiers with the highest detection accuracy and lowest model building time. The results show that the feature selection/reduction methods, including the wrapper method in combination with the discretize filter, the filter method in combination with the discretize filter, and the discretize filter, can significantly decrease model building time without compromising detection accuracy. The suggested ML algorithms and feature selection/reduction methods are automated pattern recognition approaches to detect network attacks, which are within the scope of the Symmetry journal.
topic IDS
ML classifiers
information security
network
Weka
NSL-KDD
url https://www.mdpi.com/2073-8994/12/9/1424
work_keys_str_mv AT salehalabdulwahab featureselectionmethodssimultaneouslyimprovethedetectionaccuracyandmodelbuildingtimeofmachinelearningclassifiers
AT bongkyomoon featureselectionmethodssimultaneouslyimprovethedetectionaccuracyandmodelbuildingtimeofmachinelearningclassifiers
_version_ 1724536155105394688