Artificial Intelligence Technology for Malware Family Detection
碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === The rapid development of Internet of Things (IoT) devices and communication technologies have greatly expanded the application of the internet. In response to people’s pursuit of high quality of life, the number of IoT devices and related services have increased...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/3q8ee3 |
id |
ndltd-TW-107NTUS5442088 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107NTUS54420882019-10-24T05:20:25Z http://ndltd.ncl.edu.tw/handle/3q8ee3 Artificial Intelligence Technology for Malware Family Detection 人工智慧技術於惡意軟體家族偵測之研究 Wen-Han Kuo 郭文翰 碩士 國立臺灣科技大學 電機工程系 107 The rapid development of Internet of Things (IoT) devices and communication technologies have greatly expanded the application of the internet. In response to people’s pursuit of high quality of life, the number of IoT devices and related services have increased annually. However, the importance of information security has been overlooked by majority of people, promoting hackers and those with ulterior motives to use malware to attack security holes in Internet applications. With the number of attack incidents increasing, detection system of malware has become imperative. This study proposed an integrative system framework that combines machine learning, deep learning, data balancing, and feature evaluation mechanism to detect malware, and a family-based approach was used to present classification results. The proposed framework can serve as a reference for antivirus companies and related service providers to develop adequate strategies for defending against malware attacks. This study acquired data from the CTU-13 open dataset, which was compiled through capturing the traffic from the network of a university. The dataset includes normal, malware, and background traffic. In order to reduce the noise in the dataset and improve the overall model efficiency, this study performed data analysis using feature evaluation methods including ANOVA, Chi-Square and AutoEncoder. Features that reduce the model accuracy were removed to reduce the model computation time and improve model stability. Because imbalanced data existed among various classes of malware and benign software in the original dataset, a data balancing mechanism was introduced to resolve this problem. The SMOTEENN algorithm was used to generate data for minority classes, thereby alleviating model deviations and enhancing the overall model credibility. This study also considered that malware receives updates and grows in number over time. Therefore, the neural networks architecture adopted in this study employ an activation function mechanism to detect malware. When an unknown malware program be found that does not belong to any family derived from the previous neural networks architecture, this mechanism incorporates the program in the model training for the subsequent model update. Analysis on the efficiency of the proposed framework revealed that the detection models with XGBoost and Back Propagation reached an accuracy rate of 99.98% and 98.88%, respectively. Jiann-Liang Chen 陳俊良 2019 學位論文 ; thesis 93 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 電機工程系 === 107 === The rapid development of Internet of Things (IoT) devices and communication technologies have greatly expanded the application of the internet. In response to people’s pursuit of high quality of life, the number of IoT devices and related services have increased annually. However, the importance of information security has been overlooked by majority of people, promoting hackers and those with ulterior motives to use malware to attack security holes in Internet applications. With the number of attack incidents increasing, detection system of malware has become imperative.
This study proposed an integrative system framework that combines machine learning, deep learning, data balancing, and feature evaluation mechanism to detect malware, and a family-based approach was used to present classification results. The proposed framework can serve as a reference for antivirus companies and related service providers to develop adequate strategies for defending against malware attacks. This study acquired data from the CTU-13 open dataset, which was compiled through capturing the traffic from the network of a university. The dataset includes normal, malware, and background traffic. In order to reduce the noise in the dataset and improve the overall model efficiency, this study performed data analysis using feature evaluation methods including ANOVA, Chi-Square and AutoEncoder. Features that reduce the model accuracy were removed to reduce the model computation time and improve model stability. Because imbalanced data existed among various classes of malware and benign software in the original dataset, a data balancing mechanism was introduced to resolve this problem. The SMOTEENN algorithm was used to generate data for minority classes, thereby alleviating model deviations and enhancing the overall model credibility.
This study also considered that malware receives updates and grows in number over time. Therefore, the neural networks architecture adopted in this study employ an activation function mechanism to detect malware. When an unknown malware program be found that does not belong to any family derived from the previous neural networks architecture, this mechanism incorporates the program in the model training for the subsequent model update. Analysis on the efficiency of the proposed framework revealed that the detection models with XGBoost and Back Propagation reached an accuracy rate of 99.98% and 98.88%, respectively.
|
author2 |
Jiann-Liang Chen |
author_facet |
Jiann-Liang Chen Wen-Han Kuo 郭文翰 |
author |
Wen-Han Kuo 郭文翰 |
spellingShingle |
Wen-Han Kuo 郭文翰 Artificial Intelligence Technology for Malware Family Detection |
author_sort |
Wen-Han Kuo |
title |
Artificial Intelligence Technology for Malware Family Detection |
title_short |
Artificial Intelligence Technology for Malware Family Detection |
title_full |
Artificial Intelligence Technology for Malware Family Detection |
title_fullStr |
Artificial Intelligence Technology for Malware Family Detection |
title_full_unstemmed |
Artificial Intelligence Technology for Malware Family Detection |
title_sort |
artificial intelligence technology for malware family detection |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/3q8ee3 |
work_keys_str_mv |
AT wenhankuo artificialintelligencetechnologyformalwarefamilydetection AT guōwénhàn artificialintelligencetechnologyformalwarefamilydetection AT wenhankuo réngōngzhìhuìjìshùyúèyìruǎntǐjiāzúzhēncèzhīyánjiū AT guōwénhàn réngōngzhìhuìjìshùyúèyìruǎntǐjiāzúzhēncèzhīyánjiū |
_version_ |
1719277117749854208 |