Optimizing Applications and Message-Passing Libraries for the QPACE Architecture

The goal of the QPACE project is to build a novel cost-efficient massive parallel supercomputer optimized for LQCD (Lattice Quantum Chromodynamics) applications. Unlike previous projects which use custom ASICs, this is accomplished by using the general purpose multi-core CPU PowerXCell 8i processor...

Full description

Bibliographic Details
Main Author: Wunderlich, Simon
Other Authors: TU Chemnitz, Fakultät für Informatik
Format: Dissertation
Language:English
Published: Universitätsbibliothek Chemnitz 2012
Subjects:
PPE
SPE
MPI
QCD
QMP
HPL
Online Access:http://nbn-resolving.de/urn:nbn:de:bsz:ch1-qucosa-89975
http://nbn-resolving.de/urn:nbn:de:bsz:ch1-qucosa-89975
http://www.qucosa.de/fileadmin/data/qucosa/documents/8997/Diplomarbeit_Simon_Wunderlich.pdf
http://www.qucosa.de/fileadmin/data/qucosa/documents/8997/signatur.txt.asc
id ndltd-DRESDEN-oai-qucosa.de-bsz-ch1-qucosa-89975
record_format oai_dc
spelling ndltd-DRESDEN-oai-qucosa.de-bsz-ch1-qucosa-899752013-01-07T20:05:13Z Optimizing Applications and Message-Passing Libraries for the QPACE Architecture Wunderlich, Simon PowerXCell PowerXCell 8i QPACE Cell PPE SPE MPI QCD QMP NICOLL Torus parallel supercomputer PowerXCell PowerXCell 8i QPACE Cell PPE SPE MPI QCD QMP NICOLL Torus parallel supercomputer ddc:000 Quantenchromodynamik HPL Field programmable gate array The goal of the QPACE project is to build a novel cost-efficient massive parallel supercomputer optimized for LQCD (Lattice Quantum Chromodynamics) applications. Unlike previous projects which use custom ASICs, this is accomplished by using the general purpose multi-core CPU PowerXCell 8i processor tightly coupled with a custom network processor implemented on a modern FPGA. The heterogeneous architecture of the PowerXCell 8i processor and its core-independent OS-bypassing access to the custom network hardware and application-oriented 3D torus topology pose interesting challenges for the implementation of the applications. This work will describe and evaluate the implementation possibilities of message passing APIs: the more general MPI, and the more QCD-oriented QMP, and their performance in PPE centric or SPE centric scenarios. These results will then be employed to optimize HPL for the QPACE architecture. Finally, the developed approaches and concepts will be briefly discussed regarding their applicability to heterogeneous node/network architectures as is the case in the "High-speed Network Interface with Collective Operation Support for Cell BE (NICOLL)" project. Universitätsbibliothek Chemnitz TU Chemnitz, Fakultät für Informatik Dipl-Inf. Torsten Mehlan Dipl-Ing. (FH) Hans Böttiger Dipl-Ing. (FH) Heiko J. Schick Prof. Dr-Ing. Wolfgang Rehm Dipl-Inf. Torsten Mehlan Dipl-Ing. (FH) Hans Böttiger Dipl-Ing. (FH) Heiko J. Schick 2012-07-18 doc-type:masterThesis application/pdf text/plain application/zip http://nbn-resolving.de/urn:nbn:de:bsz:ch1-qucosa-89975 urn:nbn:de:bsz:ch1-qucosa-89975 http://www.qucosa.de/fileadmin/data/qucosa/documents/8997/Diplomarbeit_Simon_Wunderlich.pdf http://www.qucosa.de/fileadmin/data/qucosa/documents/8997/signatur.txt.asc eng
collection NDLTD
language English
format Dissertation
sources NDLTD
topic PowerXCell
PowerXCell 8i
QPACE
Cell
PPE
SPE
MPI
QCD
QMP
NICOLL
Torus
parallel
supercomputer
PowerXCell
PowerXCell 8i
QPACE
Cell
PPE
SPE
MPI
QCD
QMP
NICOLL
Torus
parallel
supercomputer
ddc:000
Quantenchromodynamik
HPL
Field programmable gate array
spellingShingle PowerXCell
PowerXCell 8i
QPACE
Cell
PPE
SPE
MPI
QCD
QMP
NICOLL
Torus
parallel
supercomputer
PowerXCell
PowerXCell 8i
QPACE
Cell
PPE
SPE
MPI
QCD
QMP
NICOLL
Torus
parallel
supercomputer
ddc:000
Quantenchromodynamik
HPL
Field programmable gate array
Wunderlich, Simon
Optimizing Applications and Message-Passing Libraries for the QPACE Architecture
description The goal of the QPACE project is to build a novel cost-efficient massive parallel supercomputer optimized for LQCD (Lattice Quantum Chromodynamics) applications. Unlike previous projects which use custom ASICs, this is accomplished by using the general purpose multi-core CPU PowerXCell 8i processor tightly coupled with a custom network processor implemented on a modern FPGA. The heterogeneous architecture of the PowerXCell 8i processor and its core-independent OS-bypassing access to the custom network hardware and application-oriented 3D torus topology pose interesting challenges for the implementation of the applications. This work will describe and evaluate the implementation possibilities of message passing APIs: the more general MPI, and the more QCD-oriented QMP, and their performance in PPE centric or SPE centric scenarios. These results will then be employed to optimize HPL for the QPACE architecture. Finally, the developed approaches and concepts will be briefly discussed regarding their applicability to heterogeneous node/network architectures as is the case in the "High-speed Network Interface with Collective Operation Support for Cell BE (NICOLL)" project.
author2 TU Chemnitz, Fakultät für Informatik
author_facet TU Chemnitz, Fakultät für Informatik
Wunderlich, Simon
author Wunderlich, Simon
author_sort Wunderlich, Simon
title Optimizing Applications and Message-Passing Libraries for the QPACE Architecture
title_short Optimizing Applications and Message-Passing Libraries for the QPACE Architecture
title_full Optimizing Applications and Message-Passing Libraries for the QPACE Architecture
title_fullStr Optimizing Applications and Message-Passing Libraries for the QPACE Architecture
title_full_unstemmed Optimizing Applications and Message-Passing Libraries for the QPACE Architecture
title_sort optimizing applications and message-passing libraries for the qpace architecture
publisher Universitätsbibliothek Chemnitz
publishDate 2012
url http://nbn-resolving.de/urn:nbn:de:bsz:ch1-qucosa-89975
http://nbn-resolving.de/urn:nbn:de:bsz:ch1-qucosa-89975
http://www.qucosa.de/fileadmin/data/qucosa/documents/8997/Diplomarbeit_Simon_Wunderlich.pdf
http://www.qucosa.de/fileadmin/data/qucosa/documents/8997/signatur.txt.asc
work_keys_str_mv AT wunderlichsimon optimizingapplicationsandmessagepassinglibrariesfortheqpacearchitecture
_version_ 1716473273077727232