Integrating Multiple Clusters for Compute-intensive Applications

Multicluster grids provide one promising solution to satisfying the growing computational demands of compute-intensive applications. However, it is challenging to seamlessly integrate all participating clusters in different domains into a single virtual computational platform. In order to fully util...

Full description

Bibliographic Details
Main Author: Yun, Zhifeng
Other Authors: Katz, Daniel S
Format: Others
Language:en
Published: LSU 2011
Subjects:
Online Access:http://etd.lsu.edu/docs/available/etd-07042011-201553/
id ndltd-LSU-oai-etd.lsu.edu-etd-07042011-201553
record_format oai_dc
spelling ndltd-LSU-oai-etd.lsu.edu-etd-07042011-2015532013-01-07T22:53:33Z Integrating Multiple Clusters for Compute-intensive Applications Yun, Zhifeng Electrical & Computer Engineering Multicluster grids provide one promising solution to satisfying the growing computational demands of compute-intensive applications. However, it is challenging to seamlessly integrate all participating clusters in different domains into a single virtual computational platform. In order to fully utilize the capabilities of multicluster grids, computer scientists need to deal with the issue of joining together participating autonomic systems practically and efficiently to execute grid-enabled applications. Driven by several compute-intensive applications, this theses develops a multicluster grid management toolkit called Pelecanus to bridge the gap between user's needs and the system's heterogeneity. Application scientists will be able to conduct very large-scale execution across multiclusters with transparent QoS assurance. A novel model called DA-TC (Dynamic Assignment with Task Containers) is developed and is integrated into Pelecanus. This model uses the concept of a task container that allows one to decouple resource allocation from resource binding. It employs static load balancing for task container distribution and dynamic load balancing for task assignment. The slowest resources become useful rather than be bottlenecks in this manner. A cluster abstraction is implemented, which not only provides various cluster information for the DA-TC execution model, but also can be used as a standalone toolkit to monitor and evaluate the clusters' functionality and performance. The performance of the proposed DA-TC model is evaluated both theoretically and experimentally. Results demonstrate the importance of reducing queuing time in decreasing the total turnaround time for an application. Experiments were conducted to understand the performance of various aspects of the DA-TC model. Experiments showed that our model could significantly reduce turnaround time and increase resource utilization for our targeted application scenarios. Four applications are implemented as case studies to determine the applicability of the DA-TC model. In each case the turnaround time is greatly reduced, which demonstrates that the DA-TC model is efficient for assisting application scientists in conducting their research. In addition, virtual resources were integrated into the DA-TC model for application execution. Experiments show that the execution model proposed in this thesis can work seamlessly with multiple hybrid grid/cloud resources to achieve reduced turnaround time. Katz, Daniel S Ramanujam, Jagannathan Ditusa, John F Vaidyanathan, Ramachandran Chen, Jianhua Allen, Gabrielle D LSU 2011-07-06 text application/pdf http://etd.lsu.edu/docs/available/etd-07042011-201553/ http://etd.lsu.edu/docs/available/etd-07042011-201553/ en restricted I hereby certify that, if appropriate, I have obtained and attached herein a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to LSU or its agents the non-exclusive license to archive and make accessible, under the conditions specified below and in appropriate University policies, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.
collection NDLTD
language en
format Others
sources NDLTD
topic Electrical & Computer Engineering
spellingShingle Electrical & Computer Engineering
Yun, Zhifeng
Integrating Multiple Clusters for Compute-intensive Applications
description Multicluster grids provide one promising solution to satisfying the growing computational demands of compute-intensive applications. However, it is challenging to seamlessly integrate all participating clusters in different domains into a single virtual computational platform. In order to fully utilize the capabilities of multicluster grids, computer scientists need to deal with the issue of joining together participating autonomic systems practically and efficiently to execute grid-enabled applications. Driven by several compute-intensive applications, this theses develops a multicluster grid management toolkit called Pelecanus to bridge the gap between user's needs and the system's heterogeneity. Application scientists will be able to conduct very large-scale execution across multiclusters with transparent QoS assurance. A novel model called DA-TC (Dynamic Assignment with Task Containers) is developed and is integrated into Pelecanus. This model uses the concept of a task container that allows one to decouple resource allocation from resource binding. It employs static load balancing for task container distribution and dynamic load balancing for task assignment. The slowest resources become useful rather than be bottlenecks in this manner. A cluster abstraction is implemented, which not only provides various cluster information for the DA-TC execution model, but also can be used as a standalone toolkit to monitor and evaluate the clusters' functionality and performance. The performance of the proposed DA-TC model is evaluated both theoretically and experimentally. Results demonstrate the importance of reducing queuing time in decreasing the total turnaround time for an application. Experiments were conducted to understand the performance of various aspects of the DA-TC model. Experiments showed that our model could significantly reduce turnaround time and increase resource utilization for our targeted application scenarios. Four applications are implemented as case studies to determine the applicability of the DA-TC model. In each case the turnaround time is greatly reduced, which demonstrates that the DA-TC model is efficient for assisting application scientists in conducting their research. In addition, virtual resources were integrated into the DA-TC model for application execution. Experiments show that the execution model proposed in this thesis can work seamlessly with multiple hybrid grid/cloud resources to achieve reduced turnaround time.
author2 Katz, Daniel S
author_facet Katz, Daniel S
Yun, Zhifeng
author Yun, Zhifeng
author_sort Yun, Zhifeng
title Integrating Multiple Clusters for Compute-intensive Applications
title_short Integrating Multiple Clusters for Compute-intensive Applications
title_full Integrating Multiple Clusters for Compute-intensive Applications
title_fullStr Integrating Multiple Clusters for Compute-intensive Applications
title_full_unstemmed Integrating Multiple Clusters for Compute-intensive Applications
title_sort integrating multiple clusters for compute-intensive applications
publisher LSU
publishDate 2011
url http://etd.lsu.edu/docs/available/etd-07042011-201553/
work_keys_str_mv AT yunzhifeng integratingmultipleclustersforcomputeintensiveapplications
_version_ 1716478167227564032