Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms

Intrusion detection system (IDS) can effectively identify anomaly behaviors in the network; however, it still has low detection rate and high false alarm rate especially for anomalies with fewer records. In this paper, we propose an effective IDS by using hybrid data optimization which consists of t...

Full description

Bibliographic Details
Main Authors: Jiadong Ren, Jiawei Guo, Wang Qian, Huang Yuan, Xiaobing Hao, Hu Jingjing
Format: Article
Language:English
Published: Hindawi-Wiley 2019-01-01
Series:Security and Communication Networks
Online Access:http://dx.doi.org/10.1155/2019/7130868
id doaj-74c31c26b7784dea9a18da3a007cffc2
record_format Article
spelling doaj-74c31c26b7784dea9a18da3a007cffc22020-11-25T01:18:06ZengHindawi-WileySecurity and Communication Networks1939-01141939-01222019-01-01201910.1155/2019/71308687130868Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning AlgorithmsJiadong Ren0Jiawei Guo1Wang Qian2Huang Yuan3Xiaobing Hao4Hu Jingjing5Computer Virtual Technology and System Integration Laboratory of Hebei Province, College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei, 066000, ChinaComputer Virtual Technology and System Integration Laboratory of Hebei Province, College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei, 066000, ChinaComputer Virtual Technology and System Integration Laboratory of Hebei Province, College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei, 066000, ChinaHebei University of Engineering, School of Information & Electrical Engineering, Hebei Handan, 056038, ChinaComputer Virtual Technology and System Integration Laboratory of Hebei Province, College of Information Science and Engineering, Yanshan University, Qinhuangdao, Hebei, 066000, ChinaBeijing Key Laboratory of Software Security Engineering Technique, Beijing Institute of Technology, 5 South Zhongguancun Street, Haidian District, Beijing, 100081, ChinaIntrusion detection system (IDS) can effectively identify anomaly behaviors in the network; however, it still has low detection rate and high false alarm rate especially for anomalies with fewer records. In this paper, we propose an effective IDS by using hybrid data optimization which consists of two parts: data sampling and feature selection, called DO_IDS. In data sampling, the Isolation Forest (iForest) is used to eliminate outliers, genetic algorithm (GA) to optimize the sampling ratio, and the Random Forest (RF) classifier as the evaluation criteria to obtain the optimal training dataset. In feature selection, GA and RF are used again to obtain the optimal feature subset. Finally, an intrusion detection system based on RF is built using the optimal training dataset obtained by data sampling and the features selected by feature selection. The experiment will be carried out on the UNSW-NB15 dataset. Compared with other algorithms, the model has obvious advantages in detecting rare anomaly behaviors.http://dx.doi.org/10.1155/2019/7130868
collection DOAJ
language English
format Article
sources DOAJ
author Jiadong Ren
Jiawei Guo
Wang Qian
Huang Yuan
Xiaobing Hao
Hu Jingjing
spellingShingle Jiadong Ren
Jiawei Guo
Wang Qian
Huang Yuan
Xiaobing Hao
Hu Jingjing
Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms
Security and Communication Networks
author_facet Jiadong Ren
Jiawei Guo
Wang Qian
Huang Yuan
Xiaobing Hao
Hu Jingjing
author_sort Jiadong Ren
title Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms
title_short Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms
title_full Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms
title_fullStr Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms
title_full_unstemmed Building an Effective Intrusion Detection System by Using Hybrid Data Optimization Based on Machine Learning Algorithms
title_sort building an effective intrusion detection system by using hybrid data optimization based on machine learning algorithms
publisher Hindawi-Wiley
series Security and Communication Networks
issn 1939-0114
1939-0122
publishDate 2019-01-01
description Intrusion detection system (IDS) can effectively identify anomaly behaviors in the network; however, it still has low detection rate and high false alarm rate especially for anomalies with fewer records. In this paper, we propose an effective IDS by using hybrid data optimization which consists of two parts: data sampling and feature selection, called DO_IDS. In data sampling, the Isolation Forest (iForest) is used to eliminate outliers, genetic algorithm (GA) to optimize the sampling ratio, and the Random Forest (RF) classifier as the evaluation criteria to obtain the optimal training dataset. In feature selection, GA and RF are used again to obtain the optimal feature subset. Finally, an intrusion detection system based on RF is built using the optimal training dataset obtained by data sampling and the features selected by feature selection. The experiment will be carried out on the UNSW-NB15 dataset. Compared with other algorithms, the model has obvious advantages in detecting rare anomaly behaviors.
url http://dx.doi.org/10.1155/2019/7130868
work_keys_str_mv AT jiadongren buildinganeffectiveintrusiondetectionsystembyusinghybriddataoptimizationbasedonmachinelearningalgorithms
AT jiaweiguo buildinganeffectiveintrusiondetectionsystembyusinghybriddataoptimizationbasedonmachinelearningalgorithms
AT wangqian buildinganeffectiveintrusiondetectionsystembyusinghybriddataoptimizationbasedonmachinelearningalgorithms
AT huangyuan buildinganeffectiveintrusiondetectionsystembyusinghybriddataoptimizationbasedonmachinelearningalgorithms
AT xiaobinghao buildinganeffectiveintrusiondetectionsystembyusinghybriddataoptimizationbasedonmachinelearningalgorithms
AT hujingjing buildinganeffectiveintrusiondetectionsystembyusinghybriddataoptimizationbasedonmachinelearningalgorithms
_version_ 1725143671639113728