Parallel Community Detection Based on Distance Dynamics for Large-Scale Network
Data mining task is a challenge on finding a high-quality community structure from largescale networks. The distance dynamics model was proved to be active on regular-size network community, but it is difficult to discover the community structure effectively from the large-scale network (0.1-1 billi...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8419690/ |
id |
doaj-82715d9b66ee4c6a85cc54f8b8e5000a |
---|---|
record_format |
Article |
spelling |
doaj-82715d9b66ee4c6a85cc54f8b8e5000a2021-03-29T21:06:13ZengIEEEIEEE Access2169-35362018-01-016427754278910.1109/ACCESS.2018.28597888419690Parallel Community Detection Based on Distance Dynamics for Large-Scale NetworkTingqin He0https://orcid.org/0000-0001-7890-7567Lijun Cai1Tao Meng2Lei Chen3Ziyun Deng4https://orcid.org/0000-0003-1276-5222Zehong Cao5https://orcid.org/0000-0003-3656-0328College of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaCollege of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaCollege of Computer Science and Electronic Engineering, Hunan University, Changsha, ChinaCollege of Electrical and Information Engineering, Hunan University, Changsha, ChinaDepartment of Economics and Trade, Changsha Commerce and Tourism College, Changsha, ChinaCentre for Artificial Intelligence, Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW, AustraliaData mining task is a challenge on finding a high-quality community structure from largescale networks. The distance dynamics model was proved to be active on regular-size network community, but it is difficult to discover the community structure effectively from the large-scale network (0.1-1 billion edges), due to the limit of machine hardware and high time complexity. In this paper, we proposed a parallel community detection algorithm based on the distance dynamics model called P-Attractor, which is capable of handling the detection problem of large networks community. Our algorithm first developed a graph partitioning method to divide large network into lots of sub-networks, yet maintaining the complete neighbor structure of the original network. Then, the traditional distance dynamics model was improved by the dynamic interaction process to simulate the distance evolution of each sub-network. Finally, we discovered the real community structure by removing all external edges after evolution process. In our extensive experiments on multiple synthetic networks and real-world networks, the results showed the effectiveness and efficiency of P-Attractor, and the execution time on 4 threads and 32 threads are around 10 and 2 h, respectively. Our proposed algorithm is potential to discover community from a billion-scale network, such as Uk-2007.https://ieeexplore.ieee.org/document/8419690/Community detectioncomplex networkgraph clusteringweb mining |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Tingqin He Lijun Cai Tao Meng Lei Chen Ziyun Deng Zehong Cao |
spellingShingle |
Tingqin He Lijun Cai Tao Meng Lei Chen Ziyun Deng Zehong Cao Parallel Community Detection Based on Distance Dynamics for Large-Scale Network IEEE Access Community detection complex network graph clustering web mining |
author_facet |
Tingqin He Lijun Cai Tao Meng Lei Chen Ziyun Deng Zehong Cao |
author_sort |
Tingqin He |
title |
Parallel Community Detection Based on Distance Dynamics for Large-Scale Network |
title_short |
Parallel Community Detection Based on Distance Dynamics for Large-Scale Network |
title_full |
Parallel Community Detection Based on Distance Dynamics for Large-Scale Network |
title_fullStr |
Parallel Community Detection Based on Distance Dynamics for Large-Scale Network |
title_full_unstemmed |
Parallel Community Detection Based on Distance Dynamics for Large-Scale Network |
title_sort |
parallel community detection based on distance dynamics for large-scale network |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
Data mining task is a challenge on finding a high-quality community structure from largescale networks. The distance dynamics model was proved to be active on regular-size network community, but it is difficult to discover the community structure effectively from the large-scale network (0.1-1 billion edges), due to the limit of machine hardware and high time complexity. In this paper, we proposed a parallel community detection algorithm based on the distance dynamics model called P-Attractor, which is capable of handling the detection problem of large networks community. Our algorithm first developed a graph partitioning method to divide large network into lots of sub-networks, yet maintaining the complete neighbor structure of the original network. Then, the traditional distance dynamics model was improved by the dynamic interaction process to simulate the distance evolution of each sub-network. Finally, we discovered the real community structure by removing all external edges after evolution process. In our extensive experiments on multiple synthetic networks and real-world networks, the results showed the effectiveness and efficiency of P-Attractor, and the execution time on 4 threads and 32 threads are around 10 and 2 h, respectively. Our proposed algorithm is potential to discover community from a billion-scale network, such as Uk-2007. |
topic |
Community detection complex network graph clustering web mining |
url |
https://ieeexplore.ieee.org/document/8419690/ |
work_keys_str_mv |
AT tingqinhe parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork AT lijuncai parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork AT taomeng parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork AT leichen parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork AT ziyundeng parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork AT zehongcao parallelcommunitydetectionbasedondistancedynamicsforlargescalenetwork |
_version_ |
1724193548477136896 |