Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
The increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topolo...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9064577/ |
id |
doaj-27bc76600ff242cb9003a12702f3949e |
---|---|
record_format |
Article |
spelling |
doaj-27bc76600ff242cb9003a12702f3949e2021-03-30T01:40:45ZengIEEEIEEE Access2169-35362020-01-018728017281310.1109/ACCESS.2020.29873559064577Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph DataMinho Bae0Minjoong Jeong1Sangyoon Oh2https://orcid.org/0000-0001-5854-149XComputer Engineering, Ajou University, Suwon, South KoreaSupercomputing Department, Korea Institute of Science and Technology Information, Daejeon, South KoreaComputer Engineering, Ajou University, Suwon, South KoreaThe increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topology, where the size of each partition is balanced while the number of edge cuts is reduced. Moreover, a graph-partitioning algorithm should achieve high performance and scalability. In this study, we present a novel graph-partitioning algorithm that ensures a high edge cutting quality and excellent parallel processing performance. We apply formulas based on the label propagation algorithm to improve the quality of edge cuts and achieve fast convergence. In our approach, the necessity of applying the label propagation process for all vertices is removed, and the process is applied only for candidate vertices based on a score metric. Our proposed algorithm introduces a stabilization phase in which remote and highly connected vertices are relocated to prevent the algorithm from becoming trapped in local optima. Comparison results show that a prototype based on the proposed algorithm outperforms well-known parallel graph-partitioning frameworks in terms of speed and balance.https://ieeexplore.ieee.org/document/9064577/Data processinggraph dataparallel processingpartitioning algorithms |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Minho Bae Minjoong Jeong Sangyoon Oh |
spellingShingle |
Minho Bae Minjoong Jeong Sangyoon Oh Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data IEEE Access Data processing graph data parallel processing partitioning algorithms |
author_facet |
Minho Bae Minjoong Jeong Sangyoon Oh |
author_sort |
Minho Bae |
title |
Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data |
title_short |
Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data |
title_full |
Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data |
title_fullStr |
Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data |
title_full_unstemmed |
Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data |
title_sort |
label propagation-based parallel graph partitioning for large-scale graph data |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
The increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topology, where the size of each partition is balanced while the number of edge cuts is reduced. Moreover, a graph-partitioning algorithm should achieve high performance and scalability. In this study, we present a novel graph-partitioning algorithm that ensures a high edge cutting quality and excellent parallel processing performance. We apply formulas based on the label propagation algorithm to improve the quality of edge cuts and achieve fast convergence. In our approach, the necessity of applying the label propagation process for all vertices is removed, and the process is applied only for candidate vertices based on a score metric. Our proposed algorithm introduces a stabilization phase in which remote and highly connected vertices are relocated to prevent the algorithm from becoming trapped in local optima. Comparison results show that a prototype based on the proposed algorithm outperforms well-known parallel graph-partitioning frameworks in terms of speed and balance. |
topic |
Data processing graph data parallel processing partitioning algorithms |
url |
https://ieeexplore.ieee.org/document/9064577/ |
work_keys_str_mv |
AT minhobae labelpropagationbasedparallelgraphpartitioningforlargescalegraphdata AT minjoongjeong labelpropagationbasedparallelgraphpartitioningforlargescalegraphdata AT sangyoonoh labelpropagationbasedparallelgraphpartitioningforlargescalegraphdata |
_version_ |
1724186515756548096 |