Computation and Communication Schedule Optimization for Jobs with Shared Data
碩士 === 臺灣大學 === 資訊工程學研究所 === 95 === Almost every computation job in the cluster or grid systems requires input data in order to find the solution, and the computation cannot proceed without the required data become available. As a result a proper interleaving of data transfer and job execution has...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2007
|
Online Access: | http://ndltd.ncl.edu.tw/handle/33141578768478723083 |
id |
ndltd-TW-095NTU05392138 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095NTU053921382015-10-13T13:55:54Z http://ndltd.ncl.edu.tw/handle/33141578768478723083 Computation and Communication Schedule Optimization for Jobs with Shared Data 互享資料工作之計算與通訊排程最佳化 En-Jan Chou 周恩冉 碩士 臺灣大學 資訊工程學研究所 95 Almost every computation job in the cluster or grid systems requires input data in order to find the solution, and the computation cannot proceed without the required data become available. As a result a proper interleaving of data transfer and job execution has a significant impact on the overall efficiency. In this paper we analyze the computational complexity of the shared data job scheduling problem, with and without consideration of storage capacity constraint. We show that if there is an upper bound on the server capacity, the problem is NP-complete, even when each job depends on at most three data. On the other hand, if there is no upper bound on the server capacity, we show that there exists an efficient algorithm that gives optimal job schedule when each job depends on at most two data. We also give an effective heuristic algorithm that gives good schedule for cases where there is no limit on the number of data a job may access. Experimental results indicate that this heuristic algorithm performs very well, and gives near optimal solutions. Pangfeng Liu 劉邦鋒 2007 學位論文 ; thesis 33 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 臺灣大學 === 資訊工程學研究所 === 95 === Almost every computation job in the cluster or grid systems requires input data in order to find the solution, and the computation cannot proceed without the required data become available. As a result a proper interleaving of data transfer and job execution has a significant impact on the overall efficiency. In this paper we analyze the computational complexity of the shared data job scheduling problem, with and without consideration of storage capacity constraint. We show that if there is an upper bound on the server capacity, the problem is NP-complete, even when each job depends on at most three data. On the other hand, if there is no upper bound on the server capacity, we show that there exists an efficient algorithm that gives optimal job schedule when each job depends on at most two data. We also give an effective heuristic algorithm that gives good schedule for cases where there is no limit on the number of data a job may access. Experimental results indicate that this heuristic algorithm performs very well, and gives near optimal solutions.
|
author2 |
Pangfeng Liu |
author_facet |
Pangfeng Liu En-Jan Chou 周恩冉 |
author |
En-Jan Chou 周恩冉 |
spellingShingle |
En-Jan Chou 周恩冉 Computation and Communication Schedule Optimization for Jobs with Shared Data |
author_sort |
En-Jan Chou |
title |
Computation and Communication Schedule Optimization for Jobs with Shared Data |
title_short |
Computation and Communication Schedule Optimization for Jobs with Shared Data |
title_full |
Computation and Communication Schedule Optimization for Jobs with Shared Data |
title_fullStr |
Computation and Communication Schedule Optimization for Jobs with Shared Data |
title_full_unstemmed |
Computation and Communication Schedule Optimization for Jobs with Shared Data |
title_sort |
computation and communication schedule optimization for jobs with shared data |
publishDate |
2007 |
url |
http://ndltd.ncl.edu.tw/handle/33141578768478723083 |
work_keys_str_mv |
AT enjanchou computationandcommunicationscheduleoptimizationforjobswithshareddata AT zhōuēnrǎn computationandcommunicationscheduleoptimizationforjobswithshareddata AT enjanchou hùxiǎngzīliàogōngzuòzhījìsuànyǔtōngxùnpáichéngzuìjiāhuà AT zhōuēnrǎn hùxiǎngzīliàogōngzuòzhījìsuànyǔtōngxùnpáichéngzuìjiāhuà |
_version_ |
1717745189694472192 |