A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
A hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
2011
|
Subjects: | |
Online Access: | http://hdl.handle.net/2152/ETD-UT-2009-12-703 |
id |
ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-703 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-7032015-09-20T17:02:22ZA hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clustersWei, FangzhouAIMCFIEMPI/OpenMPMulti-core processor clusterMulti-core processingComputer architectureA hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary uniform grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the uniform grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses MPI for inter-processor data communication and OpenMP for intra-processor data exchange. The MPI/OpenMP parallel AIM is used to accelerate the MOM solution of combined-field integral equations pertinent to the analysis of scattering from perfectly conducting surfaces. The scalability and efficiency of the implementation are investigated theoretically and verified numerically by solving benchmark scattering problems on a (near) petaflop supercomputing cluster of quad-core processors. The timing and speedup results on up to 1024 processors show that the proposed hybrid MPI/OpenMP parallelization exhibits better strong scalability (fixed problem size speedup) compared to pure MPI parallelization when multiple cores are used on each processor.textYilmaz, Ali E.2011-08-02T16:25:46Z2011-08-02T16:25:46Z2009-122011-08-02December 20092011-08-02T16:26:08Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2009-12-7032152/ETD-UT-2009-12-703eng |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
AIM CFIE MPI/OpenMP Multi-core processor cluster Multi-core processing Computer architecture |
spellingShingle |
AIM CFIE MPI/OpenMP Multi-core processor cluster Multi-core processing Computer architecture Wei, Fangzhou A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters |
description |
A hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary uniform grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the uniform grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses MPI for inter-processor data communication and OpenMP for intra-processor data exchange. The MPI/OpenMP parallel AIM is used to accelerate the MOM solution of combined-field integral equations pertinent to the analysis of scattering from perfectly conducting surfaces. The scalability and efficiency of the implementation are investigated theoretically and verified numerically by solving benchmark scattering problems on a (near) petaflop supercomputing cluster of quad-core processors. The timing and speedup results on up to 1024 processors show that the proposed hybrid MPI/OpenMP parallelization exhibits better strong scalability (fixed problem size speedup) compared to pure MPI parallelization when multiple cores are used on each processor. === text |
author2 |
Yilmaz, Ali E. |
author_facet |
Yilmaz, Ali E. Wei, Fangzhou |
author |
Wei, Fangzhou |
author_sort |
Wei, Fangzhou |
title |
A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters |
title_short |
A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters |
title_full |
A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters |
title_fullStr |
A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters |
title_full_unstemmed |
A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters |
title_sort |
hybrid mpi/openmp parallelization of the adaptive integral method for multi-core clusters |
publishDate |
2011 |
url |
http://hdl.handle.net/2152/ETD-UT-2009-12-703 |
work_keys_str_mv |
AT weifangzhou ahybridmpiopenmpparallelizationoftheadaptiveintegralmethodformulticoreclusters AT weifangzhou hybridmpiopenmpparallelizationoftheadaptiveintegralmethodformulticoreclusters |
_version_ |
1716821897381937152 |