Code Classification Based on Structure Similarity

碩士 === 國立中山大學 === 資訊管理學系研究所 === 100 === Automatically classifying malware variants source code is the most important research issue in the field of digital forensics. By means of malware classification, we can get complete behavior of malware which can simplify the forensics task. In previous resear...

Full description

Bibliographic Details
Main Authors: Chia-hui Yang, 楊佳蕙
Other Authors: Chia-mei Chen
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/08816341754421214260
Description
Summary:碩士 === 國立中山大學 === 資訊管理學系研究所 === 100 === Automatically classifying malware variants source code is the most important research issue in the field of digital forensics. By means of malware classification, we can get complete behavior of malware which can simplify the forensics task. In previous researches, researchers use malware binary to perform dynamic analysis or static analysis after reverse engineering. In the other hand, malware developers even use anti-VM and obfuscation techniques try to cheating malware classifiers. With honeypots are increasingly used, researchers could get more and more malware source code. Analyzing these source codes could be the best way for malware classification. In this paper, a novel classification approach is proposed which based on logic and directory structure similarity of malwares. All collected source code will be classified correctly by hierarchical clustering algorithm. The proposed system not only helps us classify known malwares correctly but also find new type of malware. Furthermore, it avoids forensics staffs spending too much time to reanalyze known malware. And the system could also help realize attacker''s behavior and purpose. The experimental results demonstrate the system can classify the malware correctly and be applied to other source code classification aspect.