Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data

The increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topolo...

Full description

Bibliographic Details
Main Authors: Minho Bae, Minjoong Jeong, Sangyoon Oh
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9064577/
id doaj-27bc76600ff242cb9003a12702f3949e
record_format Article
spelling doaj-27bc76600ff242cb9003a12702f3949e2021-03-30T01:40:45ZengIEEEIEEE Access2169-35362020-01-018728017281310.1109/ACCESS.2020.29873559064577Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph DataMinho Bae0Minjoong Jeong1Sangyoon Oh2https://orcid.org/0000-0001-5854-149XComputer Engineering, Ajou University, Suwon, South KoreaSupercomputing Department, Korea Institute of Science and Technology Information, Daejeon, South KoreaComputer Engineering, Ajou University, Suwon, South KoreaThe increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topology, where the size of each partition is balanced while the number of edge cuts is reduced. Moreover, a graph-partitioning algorithm should achieve high performance and scalability. In this study, we present a novel graph-partitioning algorithm that ensures a high edge cutting quality and excellent parallel processing performance. We apply formulas based on the label propagation algorithm to improve the quality of edge cuts and achieve fast convergence. In our approach, the necessity of applying the label propagation process for all vertices is removed, and the process is applied only for candidate vertices based on a score metric. Our proposed algorithm introduces a stabilization phase in which remote and highly connected vertices are relocated to prevent the algorithm from becoming trapped in local optima. Comparison results show that a prototype based on the proposed algorithm outperforms well-known parallel graph-partitioning frameworks in terms of speed and balance.https://ieeexplore.ieee.org/document/9064577/Data processinggraph dataparallel processingpartitioning algorithms
collection DOAJ
language English
format Article
sources DOAJ
author Minho Bae
Minjoong Jeong
Sangyoon Oh
spellingShingle Minho Bae
Minjoong Jeong
Sangyoon Oh
Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
IEEE Access
Data processing
graph data
parallel processing
partitioning algorithms
author_facet Minho Bae
Minjoong Jeong
Sangyoon Oh
author_sort Minho Bae
title Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
title_short Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
title_full Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
title_fullStr Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
title_full_unstemmed Label Propagation-Based Parallel Graph Partitioning for Large-Scale Graph Data
title_sort label propagation-based parallel graph partitioning for large-scale graph data
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description The increasing importance of graph data in various fields requires large-scale graph data to be processed efficiently. Furthermore, well-balanced graph partitioning is a vital component of parallel/distributed graph processing. The goal of graph partitioning is to obtain a well-balanced graph topology, where the size of each partition is balanced while the number of edge cuts is reduced. Moreover, a graph-partitioning algorithm should achieve high performance and scalability. In this study, we present a novel graph-partitioning algorithm that ensures a high edge cutting quality and excellent parallel processing performance. We apply formulas based on the label propagation algorithm to improve the quality of edge cuts and achieve fast convergence. In our approach, the necessity of applying the label propagation process for all vertices is removed, and the process is applied only for candidate vertices based on a score metric. Our proposed algorithm introduces a stabilization phase in which remote and highly connected vertices are relocated to prevent the algorithm from becoming trapped in local optima. Comparison results show that a prototype based on the proposed algorithm outperforms well-known parallel graph-partitioning frameworks in terms of speed and balance.
topic Data processing
graph data
parallel processing
partitioning algorithms
url https://ieeexplore.ieee.org/document/9064577/
work_keys_str_mv AT minhobae labelpropagationbasedparallelgraphpartitioningforlargescalegraphdata
AT minjoongjeong labelpropagationbasedparallelgraphpartitioningforlargescalegraphdata
AT sangyoonoh labelpropagationbasedparallelgraphpartitioningforlargescalegraphdata
_version_ 1724186515756548096