Parallel Community Detection Based on Distance Dynamics for Large-Scale Network

Data mining task is a challenge on finding a high-quality community structure from largescale networks. The distance dynamics model was proved to be active on regular-size network community, but it is difficult to discover the community structure effectively from the large-scale network (0.1-1 billi...

Full description

Bibliographic Details
Main Authors: Tingqin He, Lijun Cai, Tao Meng, Lei Chen, Ziyun Deng, Zehong Cao
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8419690/
id doaj-82715d9b66ee4c6a85cc54f8b8e5000a
record_format Article
spelling doaj-82715d9b66ee4c6a85cc54f8b8e5000a2021-03-29T21:06:13ZengIEEEIEEE Access2169-35362018-01-016427754278910.1109/ACCESS.2018.28597888419690Parallel Community Detection Based on Distance Dynamics for Large-Scale NetworkTingqin He0https://orcid.org/0000-0001-7890-7567Lijun Cai1Tao Meng2Lei Chen3Ziyun Deng4https://orcid.org/0000-0003-1276-5222Zehong Cao5https://orcid.org/0000-0003-3656-0328College of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaCollege of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaCollege of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaDepartment of Economics and Trade, Changsha Commerce and Tourism College, Changsha, ChinaCentre for Artificial Intelligence, Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW, AustraliaData mining task is a challenge on finding a high-quality community structure from largescale networks. The distance dynamics model was proved to be active on regular-size network community, but it is difficult to discover the community structure effectively from the large-scale network (0.1-1 billion edges), due to the limit of machine hardware and high time complexity. In this paper, we proposed a parallel community detection algorithm based on the distance dynamics model called P-Attractor, which is capable of handling the detection problem of large networks community. Our algorithm first developed a graph partitioning method to divide large network into lots of sub-networks, yet maintaining the complete neighbor structure of the original network. Then, the traditional distance dynamics model was improved by the dynamic interaction process to simulate the distance evolution of each sub-network. Finally, we discovered the real community structure by removing all external edges after evolution process. In our extensive experiments on multiple synthetic networks and real-world networks, the results showed the effectiveness and efficiency of P-Attractor, and the execution time on 4 threads and 32 threads are around 10 and 2 h, respectively. Our proposed algorithm is potential to discover community from a billion-scale network, such as Uk-2007.https://ieeexplore.ieee.org/document/8419690/Community detectioncomplex networkgraph clusteringweb mining
collection DOAJ
language English
format Article
sources DOAJ
author Tingqin He
Lijun Cai
Tao Meng
Lei Chen
Ziyun Deng
Zehong Cao
spellingShingle Tingqin He
Lijun Cai
Tao Meng
Lei Chen
Ziyun Deng
Zehong Cao
Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
IEEE Access
Community detection
complex network
graph clustering
web mining
author_facet Tingqin He
Lijun Cai
Tao Meng
Lei Chen
Ziyun Deng
Zehong Cao
author_sort Tingqin He
title Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
title_short Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
title_full Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
title_fullStr Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
title_full_unstemmed Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
title_sort parallel community detection based on distance dynamics for large-scale network
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Data mining task is a challenge on finding a high-quality community structure from largescale networks. The distance dynamics model was proved to be active on regular-size network community, but it is difficult to discover the community structure effectively from the large-scale network (0.1-1 billion edges), due to the limit of machine hardware and high time complexity. In this paper, we proposed a parallel community detection algorithm based on the distance dynamics model called P-Attractor, which is capable of handling the detection problem of large networks community. Our algorithm first developed a graph partitioning method to divide large network into lots of sub-networks, yet maintaining the complete neighbor structure of the original network. Then, the traditional distance dynamics model was improved by the dynamic interaction process to simulate the distance evolution of each sub-network. Finally, we discovered the real community structure by removing all external edges after evolution process. In our extensive experiments on multiple synthetic networks and real-world networks, the results showed the effectiveness and efficiency of P-Attractor, and the execution time on 4 threads and 32 threads are around 10 and 2 h, respectively. Our proposed algorithm is potential to discover community from a billion-scale network, such as Uk-2007.
topic Community detection
complex network
graph clustering
web mining
url https://ieeexplore.ieee.org/document/8419690/
work_keys_str_mv AT tingqinhe parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork
AT lijuncai parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork
AT taomeng parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork
AT leichen parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork
AT ziyundeng parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork
AT zehongcao parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork
_version_ 1724193548477136896