Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks

In this paper, we present Par@Graph, a software toolbox to reconstruct and analyze complex climate networks having a large number of nodes (up to at least 10<sup>6</sup>) and edges (up to at least 10<sup>12</sup>). The key innovation is an efficient set of parallel software t...

Full description

Bibliographic Details
Main Authors: H. Ihshaish, A. Tantet, J. C. M. Dijkzeul, H. A. Dijkstra
Format: Article
Language:English
Published: Copernicus Publications 2015-10-01
Series:Geoscientific Model Development
Online Access:http://www.geosci-model-dev.net/8/3321/2015/gmd-8-3321-2015.pdf
id doaj-2b1057dc4f03474ea56ec24b7de4b249
record_format Article
spelling doaj-2b1057dc4f03474ea56ec24b7de4b2492020-11-24T22:32:42ZengCopernicus PublicationsGeoscientific Model Development1991-959X1991-96032015-10-018103321333110.5194/gmd-8-3321-2015Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networksH. Ihshaish0A. Tantet1J. C. M. Dijkzeul2H. A. Dijkstra3VORtech – Scientific Software Engineers, Delft, the NetherlandsInstitute for Marine and Atmospheric research Utrecht, Utrecht University, Utrecht, the NetherlandsVORtech – Scientific Software Engineers, Delft, the NetherlandsInstitute for Marine and Atmospheric research Utrecht, Utrecht University, Utrecht, the NetherlandsIn this paper, we present Par@Graph, a software toolbox to reconstruct and analyze complex climate networks having a large number of nodes (up to at least 10<sup>6</sup>) and edges (up to at least 10<sup>12</sup>). The key innovation is an efficient set of parallel software tools designed to leverage the inherited hybrid parallelism in distributed-memory clusters of multi-core machines. The performance of the toolbox is illustrated through networks derived from sea surface height (SSH) data of a global high-resolution ocean model. Less than 8 min are needed on 90 Intel Xeon E5-4650 processors to reconstruct a climate network including the preprocessing and the correlation of 3 &times; 10<sup>5</sup> SSH time series, resulting in a weighted graph with the same number of vertices and about 3.2 &times; 10<sup>8</sup> edges. In less than 14 min on 30 processors, the resulted graph's degree centrality, strength, connected components, eigenvector centrality, entropy and clustering coefficient metrics were obtained. These results indicate that a complete cycle to construct and analyze a large-scale climate network is available under 22 min Par@Graph therefore facilitates the application of climate network analysis on high-resolution observations and model results, by enabling fast network reconstruct from the calculation of statistical similarities between climate time series. It also enables network analysis at unprecedented scales on a variety of different sizes of input data sets.http://www.geosci-model-dev.net/8/3321/2015/gmd-8-3321-2015.pdf
collection DOAJ
language English
format Article
sources DOAJ
author H. Ihshaish
A. Tantet
J. C. M. Dijkzeul
H. A. Dijkstra
spellingShingle H. Ihshaish
A. Tantet
J. C. M. Dijkzeul
H. A. Dijkstra
Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks
Geoscientific Model Development
author_facet H. Ihshaish
A. Tantet
J. C. M. Dijkzeul
H. A. Dijkstra
author_sort H. Ihshaish
title Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks
title_short Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks
title_full Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks
title_fullStr Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks
title_full_unstemmed Par@Graph – a parallel toolbox for the construction and analysis of large complex climate networks
title_sort par@graph – a parallel toolbox for the construction and analysis of large complex climate networks
publisher Copernicus Publications
series Geoscientific Model Development
issn 1991-959X
1991-9603
publishDate 2015-10-01
description In this paper, we present Par@Graph, a software toolbox to reconstruct and analyze complex climate networks having a large number of nodes (up to at least 10<sup>6</sup>) and edges (up to at least 10<sup>12</sup>). The key innovation is an efficient set of parallel software tools designed to leverage the inherited hybrid parallelism in distributed-memory clusters of multi-core machines. The performance of the toolbox is illustrated through networks derived from sea surface height (SSH) data of a global high-resolution ocean model. Less than 8 min are needed on 90 Intel Xeon E5-4650 processors to reconstruct a climate network including the preprocessing and the correlation of 3 &times; 10<sup>5</sup> SSH time series, resulting in a weighted graph with the same number of vertices and about 3.2 &times; 10<sup>8</sup> edges. In less than 14 min on 30 processors, the resulted graph's degree centrality, strength, connected components, eigenvector centrality, entropy and clustering coefficient metrics were obtained. These results indicate that a complete cycle to construct and analyze a large-scale climate network is available under 22 min Par@Graph therefore facilitates the application of climate network analysis on high-resolution observations and model results, by enabling fast network reconstruct from the calculation of statistical similarities between climate time series. It also enables network analysis at unprecedented scales on a variety of different sizes of input data sets.
url http://www.geosci-model-dev.net/8/3321/2015/gmd-8-3321-2015.pdf
work_keys_str_mv AT hihshaish pargraphaparalleltoolboxfortheconstructionandanalysisoflargecomplexclimatenetworks
AT atantet pargraphaparalleltoolboxfortheconstructionandanalysisoflargecomplexclimatenetworks
AT jcmdijkzeul pargraphaparalleltoolboxfortheconstructionandanalysisoflargecomplexclimatenetworks
AT hadijkstra pargraphaparalleltoolboxfortheconstructionandanalysisoflargecomplexclimatenetworks
_version_ 1725732717454163968