Performance Evaluation and Prediction of Parallel Applications

Analyzing and understanding the performance behavior of parallel applicationson various compute infrastructures is a long-standing concern in the HighPerformance Computing community. When the targeted execution environments arenot available, simulation is a reasonable approach to obtain objectiveper...

Full description

Bibliographic Details
Main Author: Markomanolis, Georgios
Language:ENG
Published: Ecole normale supérieure de lyon - ENS LYON 2014
Subjects:
MPI
Online Access:http://tel.archives-ouvertes.fr/tel-00951125
http://tel.archives-ouvertes.fr/docs/00/95/11/25/PDF/MARKOMANOLIS_Georgios_2014_These.pdf
id ndltd-CCSD-oai-tel.archives-ouvertes.fr-tel-00951125
record_format oai_dc
spelling ndltd-CCSD-oai-tel.archives-ouvertes.fr-tel-009511252014-02-26T03:21:31Z http://tel.archives-ouvertes.fr/tel-00951125 2014ENSL0880 http://tel.archives-ouvertes.fr/docs/00/95/11/25/PDF/MARKOMANOLIS_Georgios_2014_These.pdf Performance Evaluation and Prediction of Parallel Applications Markomanolis, Georgios [INFO:INFO_OH] Computer Science/Other [INFO:INFO_OH] Informatique/Autre Trace acquisition Off-line simulation Performance evaluation Performance prediction MPI Analyzing and understanding the performance behavior of parallel applicationson various compute infrastructures is a long-standing concern in the HighPerformance Computing community. When the targeted execution environments arenot available, simulation is a reasonable approach to obtain objectiveperformance indicators and explore various ''what-if?'' scenarios. In thiswork we present a framework for the off-line simulation of MPIapplications. The main originality of our work with regard to the literature is to rely on\tit execution traces. This allows for an extreme scalability as heterogeneousand distributed resources can be used to acquire a trace. We propose a formatwhere for each event that occurs during the execution of an application we logthe volume of instructions for a computation phase or the bytes and the type ofa communication. To acquire time-independent traces of the execution of MPI applications, wehave to instrument them to log the required data. There exist many profilingtools which can instrument an application. We propose a scoring system thatcorresponds to our framework specific requirements and evaluate the mostwell-known and open source profiling tools according to it. Furthermore weintroduce an original tool called Minimal Instrumentation that was designed tofulfill the requirements of our framework. We study different instrumentationmethods and we also investigate several acquisition strategies. We detail thetools that extract the \tit traces from the instrumentation traces of somewell-known profiling tools. Finally we evaluate the whole acquisition procedureand we present the acquisition of large scale instances. We describe in detail the procedure to provide a realistic simulated platformfile to our trace replay tool taking under consideration the topology of thereal platform and the calibration procedure with regard to the application thatis going to be simulated. Moreover we present the implemented trace replaytools that we used during this work. We show that our simulator can predictthe performance of some MPI benchmarks with less than 11\% relativeerror between the real execution and simulation for the cases that there is noperformance issue. Finally, we identify the reasons of the performance issuesand we propose solutions. 2014-01-20 ENG PhD thesis Ecole normale supérieure de lyon - ENS LYON
collection NDLTD
language ENG
sources NDLTD
topic [INFO:INFO_OH] Computer Science/Other
[INFO:INFO_OH] Informatique/Autre
Trace acquisition
Off-line simulation
Performance evaluation
Performance prediction
MPI
spellingShingle [INFO:INFO_OH] Computer Science/Other
[INFO:INFO_OH] Informatique/Autre
Trace acquisition
Off-line simulation
Performance evaluation
Performance prediction
MPI
Markomanolis, Georgios
Performance Evaluation and Prediction of Parallel Applications
description Analyzing and understanding the performance behavior of parallel applicationson various compute infrastructures is a long-standing concern in the HighPerformance Computing community. When the targeted execution environments arenot available, simulation is a reasonable approach to obtain objectiveperformance indicators and explore various ''what-if?'' scenarios. In thiswork we present a framework for the off-line simulation of MPIapplications. The main originality of our work with regard to the literature is to rely on\tit execution traces. This allows for an extreme scalability as heterogeneousand distributed resources can be used to acquire a trace. We propose a formatwhere for each event that occurs during the execution of an application we logthe volume of instructions for a computation phase or the bytes and the type ofa communication. To acquire time-independent traces of the execution of MPI applications, wehave to instrument them to log the required data. There exist many profilingtools which can instrument an application. We propose a scoring system thatcorresponds to our framework specific requirements and evaluate the mostwell-known and open source profiling tools according to it. Furthermore weintroduce an original tool called Minimal Instrumentation that was designed tofulfill the requirements of our framework. We study different instrumentationmethods and we also investigate several acquisition strategies. We detail thetools that extract the \tit traces from the instrumentation traces of somewell-known profiling tools. Finally we evaluate the whole acquisition procedureand we present the acquisition of large scale instances. We describe in detail the procedure to provide a realistic simulated platformfile to our trace replay tool taking under consideration the topology of thereal platform and the calibration procedure with regard to the application thatis going to be simulated. Moreover we present the implemented trace replaytools that we used during this work. We show that our simulator can predictthe performance of some MPI benchmarks with less than 11\% relativeerror between the real execution and simulation for the cases that there is noperformance issue. Finally, we identify the reasons of the performance issuesand we propose solutions.
author Markomanolis, Georgios
author_facet Markomanolis, Georgios
author_sort Markomanolis, Georgios
title Performance Evaluation and Prediction of Parallel Applications
title_short Performance Evaluation and Prediction of Parallel Applications
title_full Performance Evaluation and Prediction of Parallel Applications
title_fullStr Performance Evaluation and Prediction of Parallel Applications
title_full_unstemmed Performance Evaluation and Prediction of Parallel Applications
title_sort performance evaluation and prediction of parallel applications
publisher Ecole normale supérieure de lyon - ENS LYON
publishDate 2014
url http://tel.archives-ouvertes.fr/tel-00951125
http://tel.archives-ouvertes.fr/docs/00/95/11/25/PDF/MARKOMANOLIS_Georgios_2014_These.pdf
work_keys_str_mv AT markomanolisgeorgios performanceevaluationandpredictionofparallelapplications
_version_ 1716648478309875712