T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems

Abstract In the last few decades, Intrusion Detection System (IDS), in particular, machine learning‐based anomaly detection, has gained importance over Signature Detection Systems (SDSs) in the novel attacks detection. Herein, a novel approach called T‐Distributed Stochastic Neighbour Embedding and...

Full description

Bibliographic Details
Main Authors: Mohamed Hammad, Nabil Hewahi, Wael Elmedany
Format: Article
Language:English
Published: Wiley 2021-03-01
Series:IET Information Security
Online Access:https://doi.org/10.1049/ise2.12020
id doaj-208eed3588bc486e8348621cd3cd1c11
record_format Article
spelling doaj-208eed3588bc486e8348621cd3cd1c112021-07-14T13:25:01ZengWileyIET Information Security1751-87091751-87172021-03-0115217819010.1049/ise2.12020T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection SystemsMohamed Hammad0Nabil Hewahi1Wael Elmedany2College of Information Technology University of Bahrain Sakhir Manama BahrainCollege of Information Technology University of Bahrain Sakhir Manama BahrainCollege of Information Technology University of Bahrain Sakhir Manama BahrainAbstract In the last few decades, Intrusion Detection System (IDS), in particular, machine learning‐based anomaly detection, has gained importance over Signature Detection Systems (SDSs) in the novel attacks detection. Herein, a novel approach called T‐Distributed Stochastic Neighbour Embedding and Random Forest Algorithm (T‐SNERF) is presented for the classification of cyber‐attacks. The approach consists of three different steps. First, the examination of feature correlations is provided. Second, the T‐Distributed Stochastic Neighbour Embedding (T‐SNE) data dimensional reduction technique is used. Third, Random Forest (RF) technique is utilised to evaluate the complications in the accuracy and False‐Positive Rate (FPR). The proposed approach has been tested on various well‐known datasets, namely, UNSW‐NB 15, CICIDS‐2017, and phishing datasets. The proposed novel approach achieved significant results compared with existing approaches, achieving 100% accuracy, and 0% FPR for the UNSW‐NB15 dataset, and achieving high accuracy rates, up to 99.7878%, and 99.7044%, for CICIDS‐2017 and Phishing datasets respectively.https://doi.org/10.1049/ise2.12020
collection DOAJ
language English
format Article
sources DOAJ
author Mohamed Hammad
Nabil Hewahi
Wael Elmedany
spellingShingle Mohamed Hammad
Nabil Hewahi
Wael Elmedany
T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems
IET Information Security
author_facet Mohamed Hammad
Nabil Hewahi
Wael Elmedany
author_sort Mohamed Hammad
title T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems
title_short T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems
title_full T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems
title_fullStr T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems
title_full_unstemmed T‐SNERF: A novel high accuracy machine learning approach for Intrusion Detection Systems
title_sort t‐snerf: a novel high accuracy machine learning approach for intrusion detection systems
publisher Wiley
series IET Information Security
issn 1751-8709
1751-8717
publishDate 2021-03-01
description Abstract In the last few decades, Intrusion Detection System (IDS), in particular, machine learning‐based anomaly detection, has gained importance over Signature Detection Systems (SDSs) in the novel attacks detection. Herein, a novel approach called T‐Distributed Stochastic Neighbour Embedding and Random Forest Algorithm (T‐SNERF) is presented for the classification of cyber‐attacks. The approach consists of three different steps. First, the examination of feature correlations is provided. Second, the T‐Distributed Stochastic Neighbour Embedding (T‐SNE) data dimensional reduction technique is used. Third, Random Forest (RF) technique is utilised to evaluate the complications in the accuracy and False‐Positive Rate (FPR). The proposed approach has been tested on various well‐known datasets, namely, UNSW‐NB 15, CICIDS‐2017, and phishing datasets. The proposed novel approach achieved significant results compared with existing approaches, achieving 100% accuracy, and 0% FPR for the UNSW‐NB15 dataset, and achieving high accuracy rates, up to 99.7878%, and 99.7044%, for CICIDS‐2017 and Phishing datasets respectively.
url https://doi.org/10.1049/ise2.12020
work_keys_str_mv AT mohamedhammad tsnerfanovelhighaccuracymachinelearningapproachforintrusiondetectionsystems
AT nabilhewahi tsnerfanovelhighaccuracymachinelearningapproachforintrusiondetectionsystems
AT waelelmedany tsnerfanovelhighaccuracymachinelearningapproachforintrusiondetectionsystems
_version_ 1721302775344660480