Code Classification Based on Structure Similarity

碩士 === 國立中山大學 === 資訊管理學系研究所 === 100 === Automatically classifying malware variants source code is the most important research issue in the field of digital forensics. By means of malware classification, we can get complete behavior of malware which can simplify the forensics task. In previous resear...

Full description

Bibliographic Details
Main Authors: Chia-hui Yang, 楊佳蕙
Other Authors: Chia-mei Chen
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/08816341754421214260
id ndltd-TW-100NSYS5396082
record_format oai_dc
spelling ndltd-TW-100NSYS53960822015-10-13T21:22:20Z http://ndltd.ncl.edu.tw/handle/08816341754421214260 Code Classification Based on Structure Similarity 基於結構相似度之原始碼分類研究 Chia-hui Yang 楊佳蕙 碩士 國立中山大學 資訊管理學系研究所 100 Automatically classifying malware variants source code is the most important research issue in the field of digital forensics. By means of malware classification, we can get complete behavior of malware which can simplify the forensics task. In previous researches, researchers use malware binary to perform dynamic analysis or static analysis after reverse engineering. In the other hand, malware developers even use anti-VM and obfuscation techniques try to cheating malware classifiers. With honeypots are increasingly used, researchers could get more and more malware source code. Analyzing these source codes could be the best way for malware classification. In this paper, a novel classification approach is proposed which based on logic and directory structure similarity of malwares. All collected source code will be classified correctly by hierarchical clustering algorithm. The proposed system not only helps us classify known malwares correctly but also find new type of malware. Furthermore, it avoids forensics staffs spending too much time to reanalyze known malware. And the system could also help realize attacker''s behavior and purpose. The experimental results demonstrate the system can classify the malware correctly and be applied to other source code classification aspect. Chia-mei Chen 陳嘉玫 2012 學位論文 ; thesis 57 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中山大學 === 資訊管理學系研究所 === 100 === Automatically classifying malware variants source code is the most important research issue in the field of digital forensics. By means of malware classification, we can get complete behavior of malware which can simplify the forensics task. In previous researches, researchers use malware binary to perform dynamic analysis or static analysis after reverse engineering. In the other hand, malware developers even use anti-VM and obfuscation techniques try to cheating malware classifiers. With honeypots are increasingly used, researchers could get more and more malware source code. Analyzing these source codes could be the best way for malware classification. In this paper, a novel classification approach is proposed which based on logic and directory structure similarity of malwares. All collected source code will be classified correctly by hierarchical clustering algorithm. The proposed system not only helps us classify known malwares correctly but also find new type of malware. Furthermore, it avoids forensics staffs spending too much time to reanalyze known malware. And the system could also help realize attacker''s behavior and purpose. The experimental results demonstrate the system can classify the malware correctly and be applied to other source code classification aspect.
author2 Chia-mei Chen
author_facet Chia-mei Chen
Chia-hui Yang
楊佳蕙
author Chia-hui Yang
楊佳蕙
spellingShingle Chia-hui Yang
楊佳蕙
Code Classification Based on Structure Similarity
author_sort Chia-hui Yang
title Code Classification Based on Structure Similarity
title_short Code Classification Based on Structure Similarity
title_full Code Classification Based on Structure Similarity
title_fullStr Code Classification Based on Structure Similarity
title_full_unstemmed Code Classification Based on Structure Similarity
title_sort code classification based on structure similarity
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/08816341754421214260
work_keys_str_mv AT chiahuiyang codeclassificationbasedonstructuresimilarity
AT yángjiāhuì codeclassificationbasedonstructuresimilarity
AT chiahuiyang jīyújiégòuxiāngshìdùzhīyuánshǐmǎfēnlèiyánjiū
AT yángjiāhuì jīyújiégòuxiāngshìdùzhīyuánshǐmǎfēnlèiyánjiū
_version_ 1718061484126240768