AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
Android application (app) stores contain a huge number of apps, which are manually classified based on the apps’ descriptions into various categories. However, the predefined categories or apps descriptions are usually not very accurate to reflect the real functionalities of apps, thereby leading to...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi-Wiley
2018-01-01
|
Series: | Wireless Communications and Mobile Computing |
Online Access: | http://dx.doi.org/10.1155/2018/1250359 |
id |
doaj-b28b884d6b1f46109b5c039bc0cffb8e |
---|---|
record_format |
Article |
spelling |
doaj-b28b884d6b1f46109b5c039bc0cffb8e2020-11-24T22:15:49ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772018-01-01201810.1155/2018/12503591250359AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive FeaturesMasoud Reyhani Hamedani0Dongjin Shin1Myeonggeon Lee2Seong-Je Cho3Changha Hwang4Department of Software Science, Department of Computer Science and Engineering, Dankook University, Yongin, Republic of KoreaDepartment of Applied Statistics, Department of Data Science, Dankook University, Yongin, Republic of KoreaDepartment of Software Science, Department of Computer Science and Engineering, Dankook University, Yongin, Republic of KoreaDepartment of Software Science, Department of Computer Science and Engineering, Dankook University, Yongin, Republic of KoreaDepartment of Applied Statistics, Department of Data Science, Dankook University, Yongin, Republic of KoreaAndroid application (app) stores contain a huge number of apps, which are manually classified based on the apps’ descriptions into various categories. However, the predefined categories or apps descriptions are usually not very accurate to reflect the real functionalities of apps, thereby leading to misclassify the apps, which may cause serious security issues and unreliability problem in the app store. Therefore, the automatic app classification is an important demand to construct a secure, reliable, integrated, and easy to navigate app store. In this paper, we propose an effective method called AndroClass to automatically classify apps based on their real functionalities by using rich and comprehensive features representing the actual functionalities of the apps. AndroClass performs three steps of feature extraction, feature refinement, and classification. In the feature extraction step, we extract 14 various features for each app by utilizing a unified tool suite. In the feature refinement step, we apply Random Forest algorithm to refine the features. In the classification step, we combine refined features into a single one and AndroClass is equipped with K-Nearest Neighbor, Naive Bayes, Support Vector Machine, and Deep Neural Network to classify apps. On the contrary to the existing methods, all the utilized features in AndroClass are stable and clearly represent the actual functionalities of the app, AndroClass does not pose any issues to the user privacy, and our method can be applied to classify unreleased or newly released apps. The results of extensive experiments with two real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of AndroClass where the classification accuracy of AndroClass with the latter dataset is 83.5%.http://dx.doi.org/10.1155/2018/1250359 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Masoud Reyhani Hamedani Dongjin Shin Myeonggeon Lee Seong-Je Cho Changha Hwang |
spellingShingle |
Masoud Reyhani Hamedani Dongjin Shin Myeonggeon Lee Seong-Je Cho Changha Hwang AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features Wireless Communications and Mobile Computing |
author_facet |
Masoud Reyhani Hamedani Dongjin Shin Myeonggeon Lee Seong-Je Cho Changha Hwang |
author_sort |
Masoud Reyhani Hamedani |
title |
AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features |
title_short |
AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features |
title_full |
AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features |
title_fullStr |
AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features |
title_full_unstemmed |
AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features |
title_sort |
androclass: an effective method to classify android applications by applying deep neural networks to comprehensive features |
publisher |
Hindawi-Wiley |
series |
Wireless Communications and Mobile Computing |
issn |
1530-8669 1530-8677 |
publishDate |
2018-01-01 |
description |
Android application (app) stores contain a huge number of apps, which are manually classified based on the apps’ descriptions into various categories. However, the predefined categories or apps descriptions are usually not very accurate to reflect the real functionalities of apps, thereby leading to misclassify the apps, which may cause serious security issues and unreliability problem in the app store. Therefore, the automatic app classification is an important demand to construct a secure, reliable, integrated, and easy to navigate app store. In this paper, we propose an effective method called AndroClass to automatically classify apps based on their real functionalities by using rich and comprehensive features representing the actual functionalities of the apps. AndroClass performs three steps of feature extraction, feature refinement, and classification. In the feature extraction step, we extract 14 various features for each app by utilizing a unified tool suite. In the feature refinement step, we apply Random Forest algorithm to refine the features. In the classification step, we combine refined features into a single one and AndroClass is equipped with K-Nearest Neighbor, Naive Bayes, Support Vector Machine, and Deep Neural Network to classify apps. On the contrary to the existing methods, all the utilized features in AndroClass are stable and clearly represent the actual functionalities of the app, AndroClass does not pose any issues to the user privacy, and our method can be applied to classify unreleased or newly released apps. The results of extensive experiments with two real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of AndroClass where the classification accuracy of AndroClass with the latter dataset is 83.5%. |
url |
http://dx.doi.org/10.1155/2018/1250359 |
work_keys_str_mv |
AT masoudreyhanihamedani androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures AT dongjinshin androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures AT myeonggeonlee androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures AT seongjecho androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures AT changhahwang androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures |
_version_ |
1725792889239240704 |