USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION

In this paper, we propose a malware classification framework using transfer learning based on existing Deep Learning models that have been pre-trained on massive image datasets. In recent years there has been a significant increase in the number and variety of malwares, which amplifies the need to i...

Full description

Bibliographic Details
Main Authors: B. Prima, M. Bouhorma
Format: Article
Language:English
Published: Copernicus Publications 2020-11-01
Series:The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLIV-4-W3-2020/343/2020/isprs-archives-XLIV-4-W3-2020-343-2020.pdf
id doaj-5f91b046a0ff497ab52de83b3eb092b2
record_format Article
spelling doaj-5f91b046a0ff497ab52de83b3eb092b22020-11-25T04:11:44ZengCopernicus PublicationsThe International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences1682-17502194-90342020-11-01XLIV-4-W3-202034334910.5194/isprs-archives-XLIV-4-W3-2020-343-2020USING TRANSFER LEARNING FOR MALWARE CLASSIFICATIONB. Prima0M. Bouhorma1Computer Science, Systems and Telecommunication Laboratory, Faculty of Sciences and Techniques, Abdelmalek Essaâdi University, Tangier 90000, MoroccoComputer Science, Systems and Telecommunication Laboratory, Faculty of Sciences and Techniques, Abdelmalek Essaâdi University, Tangier 90000, MoroccoIn this paper, we propose a malware classification framework using transfer learning based on existing Deep Learning models that have been pre-trained on massive image datasets. In recent years there has been a significant increase in the number and variety of malwares, which amplifies the need to improve automatic detection and classification of the malwares. Nowadays, neural network methodology has reached a level that may exceed the limits of previous machine learning methods, such as Hidden Markov Models and Support Vector Machines (SVM). As a result, convolutional neural networks (CNNs) have shown superior performance compared to traditional learning techniques, specifically in tasks such as image classification. Motivated by this success, we propose a CNN-based architecture for malware classification. The malicious binary files are represented as grayscale images and a deep neural network is trained by freezing the pre-trained VGG16 layers on the ImageNet dataset and adapting the last fully connected layer to the malware family classification. Our evaluation results show that our approach is able to achieve an average of 98% accuracy for the MALIMG dataset.https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLIV-4-W3-2020/343/2020/isprs-archives-XLIV-4-W3-2020-343-2020.pdf
collection DOAJ
language English
format Article
sources DOAJ
author B. Prima
M. Bouhorma
spellingShingle B. Prima
M. Bouhorma
USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
author_facet B. Prima
M. Bouhorma
author_sort B. Prima
title USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION
title_short USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION
title_full USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION
title_fullStr USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION
title_full_unstemmed USING TRANSFER LEARNING FOR MALWARE CLASSIFICATION
title_sort using transfer learning for malware classification
publisher Copernicus Publications
series The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
issn 1682-1750
2194-9034
publishDate 2020-11-01
description In this paper, we propose a malware classification framework using transfer learning based on existing Deep Learning models that have been pre-trained on massive image datasets. In recent years there has been a significant increase in the number and variety of malwares, which amplifies the need to improve automatic detection and classification of the malwares. Nowadays, neural network methodology has reached a level that may exceed the limits of previous machine learning methods, such as Hidden Markov Models and Support Vector Machines (SVM). As a result, convolutional neural networks (CNNs) have shown superior performance compared to traditional learning techniques, specifically in tasks such as image classification. Motivated by this success, we propose a CNN-based architecture for malware classification. The malicious binary files are represented as grayscale images and a deep neural network is trained by freezing the pre-trained VGG16 layers on the ImageNet dataset and adapting the last fully connected layer to the malware family classification. Our evaluation results show that our approach is able to achieve an average of 98% accuracy for the MALIMG dataset.
url https://www.int-arch-photogramm-remote-sens-spatial-inf-sci.net/XLIV-4-W3-2020/343/2020/isprs-archives-XLIV-4-W3-2020-343-2020.pdf
work_keys_str_mv AT bprima usingtransferlearningformalwareclassification
AT mbouhorma usingtransferlearningformalwareclassification
_version_ 1724417066694344704