Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines
Due to the wide use of collective operations in Message Passing Interface (MPI) applications, developing efficient collective communication routines is essential. Despite numerous research efforts for optimizing MPI collective operations, it is still not clear how to obtain MPI collective routines t...
Other Authors: | |
---|---|
Format: | Others |
Language: | English English |
Published: |
Florida State University
|
Subjects: | |
Online Access: | http://purl.flvc.org/fsu/fd/FSU_migr_etd-4504 |
id |
ndltd-fsu.edu-oai-fsu.digital.flvc.org-fsu_182623 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-fsu.edu-oai-fsu.digital.flvc.org-fsu_1826232020-06-13T03:09:00Z Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines Faraj, Ahmad (authoraut) Yuan, Xin (professor directing dissertation) Liu, Guosheng (outside committee member) Whalley, David (committee member) Gallivan, Kyle (committee member) Srinivasan, Ashok (committee member) Department of Computer Science (degree granting department) Florida State University (degree granting institution) Text text Florida State University Florida State University English eng 1 online resource computer application/pdf Due to the wide use of collective operations in Message Passing Interface (MPI) applications, developing efficient collective communication routines is essential. Despite numerous research efforts for optimizing MPI collective operations, it is still not clear how to obtain MPI collective routines that can achieve high performance across platforms and applications. In particular, while it may not be extremely difficult to develop an efficient communication algorithm for a given platform and a given application, including such an algorithm in an MPI library poses a significant challenge: the communication library is general-purpose and must provide efficient routines for different platforms and applications. In this research, a new library implementation paradigm called delayed finalization of MPI collective communication routines (DF) is proposed for realizing efficient MPI collective routines across platforms and applications. The idea is to postpone the decision of which algorithm to be used for a collective operation until the platform and/or application are known. Using the DF approach, the MPI library can maintain, for each communication operation, an extensive set of algorithms, and use an automatic algorithm selection mechanism to decide the appropriate algorithm for a given platform and a given application. Hence, a DF based library can adapt to platforms and applications. To verify that the DF approach is effective and practical, Ethernet switched clusters are selected as the experimental platform and two DF based MPI libraries, STAGE-MPI and STAR-MPI, are developed and evaluated. In the development of the DF based libraries, topology-specific algorithms for all-to-all, all-gather, and broadcast operations are designed for Ethernet switched clusters. The experimental results indicate that both STAGE-MPI and STAR-MPI significantly out-perform traditional MPI libraries including LAM/MPI and MPICH in many cases, which demonstrates that the performance of MPI collective library routines can be significantly improved by (1) incorporating platform/application specific communication algorithms in the MPI library, and (2) making the library adaptable to platforms and application A Dissertation submitted to the Department of Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy. Summer Semester, 2006. July 7, 2006. MPI, High, Empirical Techniques, Adaptability, Collective Routines Includes bibliographical references. Xin Yuan, Professor Directing Dissertation; Guosheng Liu, Outside Committee Member; David Whalley, Committee Member; Kyle Gallivan, Committee Member; Ashok Srinivasan, Committee Member. Computer science FSU_migr_etd-4504 http://purl.flvc.org/fsu/fd/FSU_migr_etd-4504 This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). The copyright in theses and dissertations completed at Florida State University is held by the students who author them. http://diginole.lib.fsu.edu/islandora/object/fsu%3A182623/datastream/TN/view/Automatic%20Empirical%20Techniques%20for%20Developing%20Efficient%20MPI%20Collective%20Communication%20Routines.jpg |
collection |
NDLTD |
language |
English English |
format |
Others
|
sources |
NDLTD |
topic |
Computer science |
spellingShingle |
Computer science Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines |
description |
Due to the wide use of collective operations in Message Passing Interface (MPI) applications, developing efficient collective communication routines is essential. Despite numerous research efforts for optimizing MPI collective operations, it is still not clear how to obtain MPI collective routines that can achieve high performance across platforms and applications. In particular, while it may not be extremely difficult to develop an efficient communication algorithm for a given platform and a given application, including such an algorithm in an MPI library poses a significant challenge: the communication library is general-purpose and must provide efficient routines for different platforms and applications. In this research, a new library implementation paradigm called delayed finalization of MPI collective communication routines (DF) is proposed for realizing efficient MPI collective routines across platforms and applications. The idea is to postpone the decision of which algorithm to be used for a collective operation until the platform and/or application are known. Using the DF approach, the MPI library can maintain, for each communication operation, an extensive set of algorithms, and use an automatic algorithm selection mechanism to decide the appropriate algorithm for a given platform and a given application. Hence, a DF based library can adapt to platforms and applications. To verify that the DF approach is effective and practical, Ethernet switched clusters are selected as the experimental platform and two DF based MPI libraries, STAGE-MPI and STAR-MPI, are developed and evaluated. In the development of the DF based libraries, topology-specific algorithms for all-to-all, all-gather, and broadcast operations are designed for Ethernet switched clusters. The experimental results indicate that both STAGE-MPI and STAR-MPI significantly out-perform traditional MPI libraries including LAM/MPI and MPICH in many cases, which demonstrates that the performance of MPI collective library routines can be significantly improved by (1) incorporating platform/application specific communication algorithms in the MPI library, and (2) making the library adaptable to platforms and application === A Dissertation submitted to the Department of Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy. === Summer Semester, 2006. === July 7, 2006. === MPI, High, Empirical Techniques, Adaptability, Collective Routines === Includes bibliographical references. === Xin Yuan, Professor Directing Dissertation; Guosheng Liu, Outside Committee Member; David Whalley, Committee Member; Kyle Gallivan, Committee Member; Ashok Srinivasan, Committee Member. |
author2 |
Faraj, Ahmad (authoraut) |
author_facet |
Faraj, Ahmad (authoraut) |
title |
Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines |
title_short |
Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines |
title_full |
Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines |
title_fullStr |
Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines |
title_full_unstemmed |
Automatic Empirical Techniques for Developing Efficient MPI Collective Communication Routines |
title_sort |
automatic empirical techniques for developing efficient mpi collective communication routines |
publisher |
Florida State University |
url |
http://purl.flvc.org/fsu/fd/FSU_migr_etd-4504 |
_version_ |
1719319388868313088 |