A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters

A hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional...

Full description

Bibliographic Details
Main Author:	Wei, Fangzhou
Other Authors:	Yilmaz, Ali E.
Format:	Others
Language:	English
Published:	2011
Subjects:	AIM CFIE MPI/OpenMP Multi-core processor cluster Multi-core processing Computer architecture
Online Access:	http://hdl.handle.net/2152/ETD-UT-2009-12-703

id	ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-703
record_format	oai_dc
spelling	ndltd-UTEXAS-oai-repositories.lib.utexas.edu-2152-ETD-UT-2009-12-7032015-09-20T17:02:22ZA hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clustersWei, FangzhouAIMCFIEMPI/OpenMPMulti-core processor clusterMulti-core processingComputer architectureA hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary uniform grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the uniform grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses MPI for inter-processor data communication and OpenMP for intra-processor data exchange. The MPI/OpenMP parallel AIM is used to accelerate the MOM solution of combined-field integral equations pertinent to the analysis of scattering from perfectly conducting surfaces. The scalability and efficiency of the implementation are investigated theoretically and verified numerically by solving benchmark scattering problems on a (near) petaflop supercomputing cluster of quad-core processors. The timing and speedup results on up to 1024 processors show that the proposed hybrid MPI/OpenMP parallelization exhibits better strong scalability (fixed problem size speedup) compared to pure MPI parallelization when multiple cores are used on each processor.textYilmaz, Ali E.2011-08-02T16:25:46Z2011-08-02T16:25:46Z2009-122011-08-02December 20092011-08-02T16:26:08Zthesisapplication/pdfhttp://hdl.handle.net/2152/ETD-UT-2009-12-7032152/ETD-UT-2009-12-703eng
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	AIM CFIE MPI/OpenMP Multi-core processor cluster Multi-core processing Computer architecture
spellingShingle	AIM CFIE MPI/OpenMP Multi-core processor cluster Multi-core processing Computer architecture Wei, Fangzhou A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
description	A hybrid of message passing and shared memory techniques is presented for scalable parallelization of the adaptive integral method (AIM), an FFT based algorithm, on clusters of identical multi-core processors. The proposed hybrid MPI/OpenMP parallelization scheme is based on a nested one-dimensional (1-D) slab decomposition of the 3-D auxiliary uniform grid and the associated AIM calculations: If there are M processors and T cores per processor, the scheme (i) divides the uniform grid into M slabs and MT sub-slabs, (ii) assigns each slab/sub-slab and the associated operations to one of the processors/cores, and (iii) uses MPI for inter-processor data communication and OpenMP for intra-processor data exchange. The MPI/OpenMP parallel AIM is used to accelerate the MOM solution of combined-field integral equations pertinent to the analysis of scattering from perfectly conducting surfaces. The scalability and efficiency of the implementation are investigated theoretically and verified numerically by solving benchmark scattering problems on a (near) petaflop supercomputing cluster of quad-core processors. The timing and speedup results on up to 1024 processors show that the proposed hybrid MPI/OpenMP parallelization exhibits better strong scalability (fixed problem size speedup) compared to pure MPI parallelization when multiple cores are used on each processor. === text
author2	Yilmaz, Ali E.
author_facet	Yilmaz, Ali E. Wei, Fangzhou
author	Wei, Fangzhou
author_sort	Wei, Fangzhou
title	A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_short	A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_full	A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_fullStr	A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_full_unstemmed	A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters
title_sort	hybrid mpi/openmp parallelization of the adaptive integral method for multi-core clusters
publishDate	2011
url	http://hdl.handle.net/2152/ETD-UT-2009-12-703
work_keys_str_mv	AT weifangzhou ahybridmpiopenmpparallelizationoftheadaptiveintegralmethodformulticoreclusters AT weifangzhou hybridmpiopenmpparallelizationoftheadaptiveintegralmethodformulticoreclusters
_version_	1716821897381937152

A hybrid MPI/OpenMP parallelization of the adaptive integral method for multi-core clusters

Similar Items