Artificial Intelligence Technology for Malware Family Detection

碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === The rapid development of Internet of Things (IoT) devices and communication technologies have greatly expanded the application of the internet. In response to people’s pursuit of high quality of life, the number of IoT devices and related services have increased...

Full description

Bibliographic Details
Main Authors: Wen-Han Kuo, 郭文翰
Other Authors: Jiann-Liang Chen
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/3q8ee3
id ndltd-TW-107NTUS5442088
record_format oai_dc
spelling ndltd-TW-107NTUS54420882019-10-24T05:20:25Z http://ndltd.ncl.edu.tw/handle/3q8ee3 Artificial Intelligence Technology for Malware Family Detection 人工智慧技術於惡意軟體家族偵測之研究 Wen-Han Kuo 郭文翰 碩士 國立臺灣科技大學 電機工程系 107 The rapid development of Internet of Things (IoT) devices and communication technologies have greatly expanded the application of the internet. In response to people’s pursuit of high quality of life, the number of IoT devices and related services have increased annually. However, the importance of information security has been overlooked by majority of people, promoting hackers and those with ulterior motives to use malware to attack security holes in Internet applications. With the number of attack incidents increasing, detection system of malware has become imperative. This study proposed an integrative system framework that combines machine learning, deep learning, data balancing, and feature evaluation mechanism to detect malware, and a family-based approach was used to present classification results. The proposed framework can serve as a reference for antivirus companies and related service providers to develop adequate strategies for defending against malware attacks. This study acquired data from the CTU-13 open dataset, which was compiled through capturing the traffic from the network of a university. The dataset includes normal, malware, and background traffic. In order to reduce the noise in the dataset and improve the overall model efficiency, this study performed data analysis using feature evaluation methods including ANOVA, Chi-Square and AutoEncoder. Features that reduce the model accuracy were removed to reduce the model computation time and improve model stability. Because imbalanced data existed among various classes of malware and benign software in the original dataset, a data balancing mechanism was introduced to resolve this problem. The SMOTEENN algorithm was used to generate data for minority classes, thereby alleviating model deviations and enhancing the overall model credibility. This study also considered that malware receives updates and grows in number over time. Therefore, the neural networks architecture adopted in this study employ an activation function mechanism to detect malware. When an unknown malware program be found that does not belong to any family derived from the previous neural networks architecture, this mechanism incorporates the program in the model training for the subsequent model update. Analysis on the efficiency of the proposed framework revealed that the detection models with XGBoost and Back Propagation reached an accuracy rate of 99.98% and 98.88%, respectively. Jiann-Liang Chen 陳俊良 2019 學位論文 ; thesis 93 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === The rapid development of Internet of Things (IoT) devices and communication technologies have greatly expanded the application of the internet. In response to people’s pursuit of high quality of life, the number of IoT devices and related services have increased annually. However, the importance of information security has been overlooked by majority of people, promoting hackers and those with ulterior motives to use malware to attack security holes in Internet applications. With the number of attack incidents increasing, detection system of malware has become imperative. This study proposed an integrative system framework that combines machine learning, deep learning, data balancing, and feature evaluation mechanism to detect malware, and a family-based approach was used to present classification results. The proposed framework can serve as a reference for antivirus companies and related service providers to develop adequate strategies for defending against malware attacks. This study acquired data from the CTU-13 open dataset, which was compiled through capturing the traffic from the network of a university. The dataset includes normal, malware, and background traffic. In order to reduce the noise in the dataset and improve the overall model efficiency, this study performed data analysis using feature evaluation methods including ANOVA, Chi-Square and AutoEncoder. Features that reduce the model accuracy were removed to reduce the model computation time and improve model stability. Because imbalanced data existed among various classes of malware and benign software in the original dataset, a data balancing mechanism was introduced to resolve this problem. The SMOTEENN algorithm was used to generate data for minority classes, thereby alleviating model deviations and enhancing the overall model credibility. This study also considered that malware receives updates and grows in number over time. Therefore, the neural networks architecture adopted in this study employ an activation function mechanism to detect malware. When an unknown malware program be found that does not belong to any family derived from the previous neural networks architecture, this mechanism incorporates the program in the model training for the subsequent model update. Analysis on the efficiency of the proposed framework revealed that the detection models with XGBoost and Back Propagation reached an accuracy rate of 99.98% and 98.88%, respectively.
author2 Jiann-Liang Chen
author_facet Jiann-Liang Chen
Wen-Han Kuo
郭文翰
author Wen-Han Kuo
郭文翰
spellingShingle Wen-Han Kuo
郭文翰
Artificial Intelligence Technology for Malware Family Detection
author_sort Wen-Han Kuo
title Artificial Intelligence Technology for Malware Family Detection
title_short Artificial Intelligence Technology for Malware Family Detection
title_full Artificial Intelligence Technology for Malware Family Detection
title_fullStr Artificial Intelligence Technology for Malware Family Detection
title_full_unstemmed Artificial Intelligence Technology for Malware Family Detection
title_sort artificial intelligence technology for malware family detection
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/3q8ee3
work_keys_str_mv AT wenhankuo artificialintelligencetechnologyformalwarefamilydetection
AT guōwénhàn artificialintelligencetechnologyformalwarefamilydetection
AT wenhankuo réngōngzhìhuìjìshùyúèyìruǎntǐjiāzúzhēncèzhīyánjiū
AT guōwénhàn réngōngzhìhuìjìshùyúèyìruǎntǐjiāzúzhēncèzhīyánjiū
_version_ 1719277117749854208