AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features

Android application (app) stores contain a huge number of apps, which are manually classified based on the apps’ descriptions into various categories. However, the predefined categories or apps descriptions are usually not very accurate to reflect the real functionalities of apps, thereby leading to...

Full description

Bibliographic Details
Main Authors: Masoud Reyhani Hamedani, Dongjin Shin, Myeonggeon Lee, Seong-Je Cho, Changha Hwang
Format: Article
Language:English
Published: Hindawi-Wiley 2018-01-01
Series:Wireless Communications and Mobile Computing
Online Access:http://dx.doi.org/10.1155/2018/1250359
id doaj-b28b884d6b1f46109b5c039bc0cffb8e
record_format Article
spelling doaj-b28b884d6b1f46109b5c039bc0cffb8e2020-11-24T22:15:49ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772018-01-01201810.1155/2018/12503591250359AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive FeaturesMasoud Reyhani Hamedani0Dongjin Shin1Myeonggeon Lee2Seong-Je Cho3Changha Hwang4Department of Software Science, Department of Computer Science and Engineering, Dankook University, Yongin, Republic of KoreaDepartment of Applied Statistics, Department of Data Science, Dankook University, Yongin, Republic of KoreaDepartment of Software Science, Department of Computer Science and Engineering, Dankook University, Yongin, Republic of KoreaDepartment of Software Science, Department of Computer Science and Engineering, Dankook University, Yongin, Republic of KoreaDepartment of Applied Statistics, Department of Data Science, Dankook University, Yongin, Republic of KoreaAndroid application (app) stores contain a huge number of apps, which are manually classified based on the apps’ descriptions into various categories. However, the predefined categories or apps descriptions are usually not very accurate to reflect the real functionalities of apps, thereby leading to misclassify the apps, which may cause serious security issues and unreliability problem in the app store. Therefore, the automatic app classification is an important demand to construct a secure, reliable, integrated, and easy to navigate app store. In this paper, we propose an effective method called AndroClass to automatically classify apps based on their real functionalities by using rich and comprehensive features representing the actual functionalities of the apps. AndroClass performs three steps of feature extraction, feature refinement, and classification. In the feature extraction step, we extract 14 various features for each app by utilizing a unified tool suite. In the feature refinement step, we apply Random Forest algorithm to refine the features. In the classification step, we combine refined features into a single one and AndroClass is equipped with K-Nearest Neighbor, Naive Bayes, Support Vector Machine, and Deep Neural Network to classify apps. On the contrary to the existing methods, all the utilized features in AndroClass are stable and clearly represent the actual functionalities of the app, AndroClass does not pose any issues to the user privacy, and our method can be applied to classify unreleased or newly released apps. The results of extensive experiments with two real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of AndroClass where the classification accuracy of AndroClass with the latter dataset is 83.5%.http://dx.doi.org/10.1155/2018/1250359
collection DOAJ
language English
format Article
sources DOAJ
author Masoud Reyhani Hamedani
Dongjin Shin
Myeonggeon Lee
Seong-Je Cho
Changha Hwang
spellingShingle Masoud Reyhani Hamedani
Dongjin Shin
Myeonggeon Lee
Seong-Je Cho
Changha Hwang
AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
Wireless Communications and Mobile Computing
author_facet Masoud Reyhani Hamedani
Dongjin Shin
Myeonggeon Lee
Seong-Je Cho
Changha Hwang
author_sort Masoud Reyhani Hamedani
title AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
title_short AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
title_full AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
title_fullStr AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
title_full_unstemmed AndroClass: An Effective Method to Classify Android Applications by Applying Deep Neural Networks to Comprehensive Features
title_sort androclass: an effective method to classify android applications by applying deep neural networks to comprehensive features
publisher Hindawi-Wiley
series Wireless Communications and Mobile Computing
issn 1530-8669
1530-8677
publishDate 2018-01-01
description Android application (app) stores contain a huge number of apps, which are manually classified based on the apps’ descriptions into various categories. However, the predefined categories or apps descriptions are usually not very accurate to reflect the real functionalities of apps, thereby leading to misclassify the apps, which may cause serious security issues and unreliability problem in the app store. Therefore, the automatic app classification is an important demand to construct a secure, reliable, integrated, and easy to navigate app store. In this paper, we propose an effective method called AndroClass to automatically classify apps based on their real functionalities by using rich and comprehensive features representing the actual functionalities of the apps. AndroClass performs three steps of feature extraction, feature refinement, and classification. In the feature extraction step, we extract 14 various features for each app by utilizing a unified tool suite. In the feature refinement step, we apply Random Forest algorithm to refine the features. In the classification step, we combine refined features into a single one and AndroClass is equipped with K-Nearest Neighbor, Naive Bayes, Support Vector Machine, and Deep Neural Network to classify apps. On the contrary to the existing methods, all the utilized features in AndroClass are stable and clearly represent the actual functionalities of the app, AndroClass does not pose any issues to the user privacy, and our method can be applied to classify unreleased or newly released apps. The results of extensive experiments with two real-world datasets and a dataset constructed by human experts demonstrate the effectiveness of AndroClass where the classification accuracy of AndroClass with the latter dataset is 83.5%.
url http://dx.doi.org/10.1155/2018/1250359
work_keys_str_mv AT masoudreyhanihamedani androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures
AT dongjinshin androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures
AT myeonggeonlee androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures
AT seongjecho androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures
AT changhahwang androclassaneffectivemethodtoclassifyandroidapplicationsbyapplyingdeepneuralnetworkstocomprehensivefeatures
_version_ 1725792889239240704