Integrating Multiple Clusters for Compute-intensive Applications

Multicluster grids provide one promising solution to satisfying the growing computational demands of compute-intensive applications. However, it is challenging to seamlessly integrate all participating clusters in different domains into a single virtual computational platform. In order to fully util...

Full description

Bibliographic Details
Main Author:	Yun, Zhifeng
Other Authors:	Katz, Daniel S
Format:	Others
Language:	en
Published:	LSU 2011
Subjects:	Electrical & Computer Engineering
Online Access:	http://etd.lsu.edu/docs/available/etd-07042011-201553/

id	ndltd-LSU-oai-etd.lsu.edu-etd-07042011-201553
record_format	oai_dc
spelling	ndltd-LSU-oai-etd.lsu.edu-etd-07042011-2015532013-01-07T22:53:33Z Integrating Multiple Clusters for Compute-intensive Applications Yun, Zhifeng Electrical & Computer Engineering Multicluster grids provide one promising solution to satisfying the growing computational demands of compute-intensive applications. However, it is challenging to seamlessly integrate all participating clusters in different domains into a single virtual computational platform. In order to fully utilize the capabilities of multicluster grids, computer scientists need to deal with the issue of joining together participating autonomic systems practically and efficiently to execute grid-enabled applications. Driven by several compute-intensive applications, this theses develops a multicluster grid management toolkit called Pelecanus to bridge the gap between user's needs and the system's heterogeneity. Application scientists will be able to conduct very large-scale execution across multiclusters with transparent QoS assurance. A novel model called DA-TC (Dynamic Assignment with Task Containers) is developed and is integrated into Pelecanus. This model uses the concept of a task container that allows one to decouple resource allocation from resource binding. It employs static load balancing for task container distribution and dynamic load balancing for task assignment. The slowest resources become useful rather than be bottlenecks in this manner. A cluster abstraction is implemented, which not only provides various cluster information for the DA-TC execution model, but also can be used as a standalone toolkit to monitor and evaluate the clusters' functionality and performance. The performance of the proposed DA-TC model is evaluated both theoretically and experimentally. Results demonstrate the importance of reducing queuing time in decreasing the total turnaround time for an application. Experiments were conducted to understand the performance of various aspects of the DA-TC model. Experiments showed that our model could significantly reduce turnaround time and increase resource utilization for our targeted application scenarios. Four applications are implemented as case studies to determine the applicability of the DA-TC model. In each case the turnaround time is greatly reduced, which demonstrates that the DA-TC model is efficient for assisting application scientists in conducting their research. In addition, virtual resources were integrated into the DA-TC model for application execution. Experiments show that the execution model proposed in this thesis can work seamlessly with multiple hybrid grid/cloud resources to achieve reduced turnaround time. Katz, Daniel S Ramanujam, Jagannathan Ditusa, John F Vaidyanathan, Ramachandran Chen, Jianhua Allen, Gabrielle D LSU 2011-07-06 text application/pdf http://etd.lsu.edu/docs/available/etd-07042011-201553/ http://etd.lsu.edu/docs/available/etd-07042011-201553/ en restricted I hereby certify that, if appropriate, I have obtained and attached herein a written permission statement from the owner(s) of each third party copyrighted matter to be included in my thesis, dissertation, or project report, allowing distribution as specified below. I certify that the version I submitted is the same as that approved by my advisory committee. I hereby grant to LSU or its agents the non-exclusive license to archive and make accessible, under the conditions specified below and in appropriate University policies, my thesis, dissertation, or project report in whole or in part in all forms of media, now or hereafter known. I retain all other ownership rights to the copyright of the thesis, dissertation or project report. I also retain the right to use in future works (such as articles or books) all or part of this thesis, dissertation, or project report.
collection	NDLTD
language	en
format	Others
sources	NDLTD
topic	Electrical & Computer Engineering
spellingShingle	Electrical & Computer Engineering Yun, Zhifeng Integrating Multiple Clusters for Compute-intensive Applications
description	Multicluster grids provide one promising solution to satisfying the growing computational demands of compute-intensive applications. However, it is challenging to seamlessly integrate all participating clusters in different domains into a single virtual computational platform. In order to fully utilize the capabilities of multicluster grids, computer scientists need to deal with the issue of joining together participating autonomic systems practically and efficiently to execute grid-enabled applications. Driven by several compute-intensive applications, this theses develops a multicluster grid management toolkit called Pelecanus to bridge the gap between user's needs and the system's heterogeneity. Application scientists will be able to conduct very large-scale execution across multiclusters with transparent QoS assurance. A novel model called DA-TC (Dynamic Assignment with Task Containers) is developed and is integrated into Pelecanus. This model uses the concept of a task container that allows one to decouple resource allocation from resource binding. It employs static load balancing for task container distribution and dynamic load balancing for task assignment. The slowest resources become useful rather than be bottlenecks in this manner. A cluster abstraction is implemented, which not only provides various cluster information for the DA-TC execution model, but also can be used as a standalone toolkit to monitor and evaluate the clusters' functionality and performance. The performance of the proposed DA-TC model is evaluated both theoretically and experimentally. Results demonstrate the importance of reducing queuing time in decreasing the total turnaround time for an application. Experiments were conducted to understand the performance of various aspects of the DA-TC model. Experiments showed that our model could significantly reduce turnaround time and increase resource utilization for our targeted application scenarios. Four applications are implemented as case studies to determine the applicability of the DA-TC model. In each case the turnaround time is greatly reduced, which demonstrates that the DA-TC model is efficient for assisting application scientists in conducting their research. In addition, virtual resources were integrated into the DA-TC model for application execution. Experiments show that the execution model proposed in this thesis can work seamlessly with multiple hybrid grid/cloud resources to achieve reduced turnaround time.
author2	Katz, Daniel S
author_facet	Katz, Daniel S Yun, Zhifeng
author	Yun, Zhifeng
author_sort	Yun, Zhifeng
title	Integrating Multiple Clusters for Compute-intensive Applications
title_short	Integrating Multiple Clusters for Compute-intensive Applications
title_full	Integrating Multiple Clusters for Compute-intensive Applications
title_fullStr	Integrating Multiple Clusters for Compute-intensive Applications
title_full_unstemmed	Integrating Multiple Clusters for Compute-intensive Applications
title_sort	integrating multiple clusters for compute-intensive applications
publisher	LSU
publishDate	2011
url	http://etd.lsu.edu/docs/available/etd-07042011-201553/
work_keys_str_mv	AT yunzhifeng integratingmultipleclustersforcomputeintensiveapplications
_version_	1716478167227564032

Integrating Multiple Clusters for Compute-intensive Applications

Similar Items