HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections

Graphs are used in numerous applications to model real-world systems and phenomena. The ever increasing size of graphs makes them difficult to query and analyze. In this paper, we propose HcRPC, a Highly compact Reachability Preserving Graph compression algorithm with Corrections, which is capable o...

Full description

Bibliographic Details
Main Authors: Rui Bing, Huifang Ma, Xiangchun He, Zhixin Li, Lijun Guo
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8839789/
id doaj-ac4e5d417352453eb9758c714c930346
record_format Article
spelling doaj-ac4e5d417352453eb9758c714c9303462021-03-29T23:05:53ZengIEEEIEEE Access2169-35362019-01-01713656813658010.1109/ACCESS.2019.29417668839789HcRPC: Highly Compact Reachability Preserving Graph Compression With CorrectionsRui Bing0Huifang Ma1https://orcid.org/0000-0002-5104-8982Xiangchun He2Zhixin Li3Lijun Guo4College of Computer Science and Engineering, Northwest Normal University, Lanzhou, ChinaCollege of Computer Science and Engineering, Northwest Normal University, Lanzhou, ChinaCollege of Computer Science and Engineering, Northwest Normal University, Lanzhou, ChinaGuangxi Key Lab of Multi-Source Information Mining and Security, Guangxi Normal University, Guilin, ChinaCollege of Information Science and Engineering, Ningbo University, Ningbo, ChinaGraphs are used in numerous applications to model real-world systems and phenomena. The ever increasing size of graphs makes them difficult to query and analyze. In this paper, we propose HcRPC, a Highly compact Reachability Preserving Graph compression algorithm with Corrections, which is capable of preserving the reachability relations between the nodes in original graph. The highly compressed representation of a given graph consists of a compressed graph and a set of corrections. The original graph is compressed on the basis of equivalence class obtained via the reachability relations between nodes in the original graph. In the compressed graph, each node corresponds to a set of nodes from the original graph with similar ancestors and descendants, and each edge represents linkage between the original nodes in any two node sets. The corrections portion specifies the set of corrections, including equivalent class-node corrections and node-node corrections. MinHash technique is utilized to speed up checking whether equivalence classes are structure-similar and the pair of equivalence classes with high similarity are thus merged to acquire a highly compressed graph. Besides, we develop an algorithm for preserving compressed graph with a set of corrections in response to changes to the original graph. We evaluate our algorithms on real-life graph data sets and the results indicate that graph data sets can be highly compressed while preserving the reachability relations between nodes.https://ieeexplore.ieee.org/document/8839789/Graph compressionreachability queryMinHashdynamic graph
collection DOAJ
language English
format Article
sources DOAJ
author Rui Bing
Huifang Ma
Xiangchun He
Zhixin Li
Lijun Guo
spellingShingle Rui Bing
Huifang Ma
Xiangchun He
Zhixin Li
Lijun Guo
HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections
IEEE Access
Graph compression
reachability query
MinHash
dynamic graph
author_facet Rui Bing
Huifang Ma
Xiangchun He
Zhixin Li
Lijun Guo
author_sort Rui Bing
title HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections
title_short HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections
title_full HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections
title_fullStr HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections
title_full_unstemmed HcRPC: Highly Compact Reachability Preserving Graph Compression With Corrections
title_sort hcrpc: highly compact reachability preserving graph compression with corrections
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Graphs are used in numerous applications to model real-world systems and phenomena. The ever increasing size of graphs makes them difficult to query and analyze. In this paper, we propose HcRPC, a Highly compact Reachability Preserving Graph compression algorithm with Corrections, which is capable of preserving the reachability relations between the nodes in original graph. The highly compressed representation of a given graph consists of a compressed graph and a set of corrections. The original graph is compressed on the basis of equivalence class obtained via the reachability relations between nodes in the original graph. In the compressed graph, each node corresponds to a set of nodes from the original graph with similar ancestors and descendants, and each edge represents linkage between the original nodes in any two node sets. The corrections portion specifies the set of corrections, including equivalent class-node corrections and node-node corrections. MinHash technique is utilized to speed up checking whether equivalence classes are structure-similar and the pair of equivalence classes with high similarity are thus merged to acquire a highly compressed graph. Besides, we develop an algorithm for preserving compressed graph with a set of corrections in response to changes to the original graph. We evaluate our algorithms on real-life graph data sets and the results indicate that graph data sets can be highly compressed while preserving the reachability relations between nodes.
topic Graph compression
reachability query
MinHash
dynamic graph
url https://ieeexplore.ieee.org/document/8839789/
work_keys_str_mv AT ruibing hcrpchighlycompactreachabilitypreservinggraphcompressionwithcorrections
AT huifangma hcrpchighlycompactreachabilitypreservinggraphcompressionwithcorrections
AT xiangchunhe hcrpchighlycompactreachabilitypreservinggraphcompressionwithcorrections
AT zhixinli hcrpchighlycompactreachabilitypreservinggraphcompressionwithcorrections
AT lijunguo hcrpchighlycompactreachabilitypreservinggraphcompressionwithcorrections
_version_ 1724190134335700992