Computation and Communication Schedule Optimization for Jobs with Shared Data

碩士 === 臺灣大學 === 資訊工程學研究所 === 95 === Almost every computation job in the cluster or grid systems requires input data in order to find the solution, and the computation cannot proceed without the required data become available. As a result a proper interleaving of data transfer and job execution has...

Full description

Bibliographic Details
Main Authors: En-Jan Chou, 周恩冉
Other Authors: Pangfeng Liu
Format: Others
Language:en_US
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/33141578768478723083
id ndltd-TW-095NTU05392138
record_format oai_dc
spelling ndltd-TW-095NTU053921382015-10-13T13:55:54Z http://ndltd.ncl.edu.tw/handle/33141578768478723083 Computation and Communication Schedule Optimization for Jobs with Shared Data 互享資料工作之計算與通訊排程最佳化 En-Jan Chou 周恩冉 碩士 臺灣大學 資訊工程學研究所 95 Almost every computation job in the cluster or grid systems requires input data in order to find the solution, and the computation cannot proceed without the required data become available. As a result a proper interleaving of data transfer and job execution has a significant impact on the overall efficiency. In this paper we analyze the computational complexity of the shared data job scheduling problem, with and without consideration of storage capacity constraint. We show that if there is an upper bound on the server capacity, the problem is NP-complete, even when each job depends on at most three data. On the other hand, if there is no upper bound on the server capacity, we show that there exists an efficient algorithm that gives optimal job schedule when each job depends on at most two data. We also give an effective heuristic algorithm that gives good schedule for cases where there is no limit on the number of data a job may access. Experimental results indicate that this heuristic algorithm performs very well, and gives near optimal solutions. Pangfeng Liu 劉邦鋒 2007 學位論文 ; thesis 33 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 臺灣大學 === 資訊工程學研究所 === 95 === Almost every computation job in the cluster or grid systems requires input data in order to find the solution, and the computation cannot proceed without the required data become available. As a result a proper interleaving of data transfer and job execution has a significant impact on the overall efficiency. In this paper we analyze the computational complexity of the shared data job scheduling problem, with and without consideration of storage capacity constraint. We show that if there is an upper bound on the server capacity, the problem is NP-complete, even when each job depends on at most three data. On the other hand, if there is no upper bound on the server capacity, we show that there exists an efficient algorithm that gives optimal job schedule when each job depends on at most two data. We also give an effective heuristic algorithm that gives good schedule for cases where there is no limit on the number of data a job may access. Experimental results indicate that this heuristic algorithm performs very well, and gives near optimal solutions.
author2 Pangfeng Liu
author_facet Pangfeng Liu
En-Jan Chou
周恩冉
author En-Jan Chou
周恩冉
spellingShingle En-Jan Chou
周恩冉
Computation and Communication Schedule Optimization for Jobs with Shared Data
author_sort En-Jan Chou
title Computation and Communication Schedule Optimization for Jobs with Shared Data
title_short Computation and Communication Schedule Optimization for Jobs with Shared Data
title_full Computation and Communication Schedule Optimization for Jobs with Shared Data
title_fullStr Computation and Communication Schedule Optimization for Jobs with Shared Data
title_full_unstemmed Computation and Communication Schedule Optimization for Jobs with Shared Data
title_sort computation and communication schedule optimization for jobs with shared data
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/33141578768478723083
work_keys_str_mv AT enjanchou computationandcommunicationscheduleoptimizationforjobswithshareddata
AT zhōuēnrǎn computationandcommunicationscheduleoptimizationforjobswithshareddata
AT enjanchou hùxiǎngzīliàogōngzuòzhījìsuànyǔtōngxùnpáichéngzuìjiāhuà
AT zhōuēnrǎn hùxiǎngzīliàogōngzuòzhījìsuànyǔtōngxùnpáichéngzuìjiāhuà
_version_ 1717745189694472192