A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters

A hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional...

Full description

Bibliographic Details
Main Author: Wei, Fangzhou
Other Authors: Yilmaz, Ali E.
Format: Others
Language:English
Published: 2011
Subjects:
AIM
Online Access:http://hdl.handle.net/2152/ETD-UT-2009-12-703
id ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-703
record_format oai_dc
spelling ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-7032015-09-20T17:02:22ZA hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clustersWei, FangzhouAIMCFIEMPI/OpenMPMulti-core processor clusterMulti-core processingComputer architectureA hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary uniform grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the uniform grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses MPI for inter-processor data communication and OpenMP for intra-processor data exchange. The MPI/OpenMP parallel AIM is used to accelerate the MOM solution of combined-field integral equations pertinent to the analysis of scattering from perfectly conducting surfaces. The scalability and efficiency of the implementation are investigated theoretically and verified numerically by solving benchmark scattering problems on a (near) petaflop supercomputing cluster of quad-core processors. The timing and speedup results on up to 1024 processors show that the proposed hybrid MPI/OpenMP parallelization exhibits better strong scalability (fixed problem size speedup) compared to pure MPI parallelization when multiple cores are used on each processor.textYilmaz, Ali E.2011-08-02T16:25:46Z2011-08-02T16:25:46Z2009-122011-08-02December 20092011-08-02T16:26:08Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2009-12-7032152/ETD-UT-2009-12-703eng
collection NDLTD
language English
format Others
sources NDLTD
topic AIM
CFIE
MPI/OpenMP
Multi-core processor cluster
Multi-core processing
Computer architecture
spellingShingle AIM
CFIE
MPI/OpenMP
Multi-core processor cluster
Multi-core processing
Computer architecture
Wei, Fangzhou
A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
description A hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary uniform grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the uniform grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses MPI for inter-processor data communication and OpenMP for intra-processor data exchange. The MPI/OpenMP parallel AIM is used to accelerate the MOM solution of combined-field integral equations pertinent to the analysis of scattering from perfectly conducting surfaces. The scalability and efficiency of the implementation are investigated theoretically and verified numerically by solving benchmark scattering problems on a (near) petaflop supercomputing cluster of quad-core processors. The timing and speedup results on up to 1024 processors show that the proposed hybrid MPI/OpenMP parallelization exhibits better strong scalability (fixed problem size speedup) compared to pure MPI parallelization when multiple cores are used on each processor. === text
author2 Yilmaz, Ali E.
author_facet Yilmaz, Ali E.
Wei, Fangzhou
author Wei, Fangzhou
author_sort Wei, Fangzhou
title A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_short A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_full A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_fullStr A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_full_unstemmed A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_sort hybrid mpi/openmp parallelization of the adaptive integral method for multi-core clusters
publishDate 2011
url http://hdl.handle.net/2152/ETD-UT-2009-12-703
work_keys_str_mv AT weifangzhou ahybridmpiopenmpparallelizationoftheadaptiveintegralmethodformulticoreclusters
AT weifangzhou hybridmpiopenmpparallelizationoftheadaptiveintegralmethodformulticoreclusters
_version_ 1716821897381937152