Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
Heterogeneous systems consisting of multiple multi-core CPUs and many-core accelerators have recently come into wide use, and more and more parallel applications are developed in such a heterogeneous system. To fully utilize multiple compute devices to cooperatively and concurrently execute data-par...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9406583/ |
id |
doaj-98ed39d48a2b4dc397b21ed5de06e0ef |
---|---|
record_format |
Article |
spelling |
doaj-98ed39d48a2b4dc397b21ed5de06e0ef2021-04-23T23:00:25ZengIEEEIEEE Access2169-35362021-01-019599685997810.1109/ACCESS.2021.30739559406583Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous SystemsLanjun Wan0https://orcid.org/0000-0001-7236-3589Weihua Zheng1Xinpan Yuan2School of Computer Science, Hunan University of Technology, Zhuzhou, ChinaCollege of Electrical and Information Engineering, Hunan University of Technology, Zhuzhou, ChinaSchool of Computer Science, Hunan University of Technology, Zhuzhou, ChinaHeterogeneous systems consisting of multiple multi-core CPUs and many-core accelerators have recently come into wide use, and more and more parallel applications are developed in such a heterogeneous system. To fully utilize multiple compute devices to cooperatively and concurrently execute data-parallel kernels on heterogeneous systems, a feedback-based dynamic and elastic task scheduling scheme is proposed, which can provide a better load balance, a greater device utilization, and a lower scheduling overhead by flexibly and dynamically adjusting the workload between devices during execution. The proposed method is more suitable for data-parallel kernels whose computation and data are uniformly distributed, but is less suitable for data-parallel kernels whose computation and data are non-uniformly distributed. Thus, an asynchronous-based dynamic and elastic task scheduling scheme is proposed, which can avoid device underutilization, load imbalance across devices, and frequent kernel launches, inter-device data transfers and inter-device synchronizations by dynamically adjusting the chunk size according to the performance change during runtime. A series of experiments are conducted with 8 representative parallel applications on a hybrid CPU-GPU-MIC system, the results show that the proposed two inter-device task scheduling schemes can achieve the efficient CPU-GPU-MIC co-processing of different parallel applications by effectively partitioning work across devices.https://ieeexplore.ieee.org/document/9406583/Data-parallel kernelsheterogeneous systemsmany-core acceleratorsmulti-core CPUsmulti-device co-processingparallel applications |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Lanjun Wan Weihua Zheng Xinpan Yuan |
spellingShingle |
Lanjun Wan Weihua Zheng Xinpan Yuan Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems IEEE Access Data-parallel kernels heterogeneous systems many-core accelerators multi-core CPUs multi-device co-processing parallel applications |
author_facet |
Lanjun Wan Weihua Zheng Xinpan Yuan |
author_sort |
Lanjun Wan |
title |
Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems |
title_short |
Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems |
title_full |
Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems |
title_fullStr |
Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems |
title_full_unstemmed |
Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems |
title_sort |
efficient inter-device task scheduling schemes for multi-device co-processing of data-parallel kernels on heterogeneous systems |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Heterogeneous systems consisting of multiple multi-core CPUs and many-core accelerators have recently come into wide use, and more and more parallel applications are developed in such a heterogeneous system. To fully utilize multiple compute devices to cooperatively and concurrently execute data-parallel kernels on heterogeneous systems, a feedback-based dynamic and elastic task scheduling scheme is proposed, which can provide a better load balance, a greater device utilization, and a lower scheduling overhead by flexibly and dynamically adjusting the workload between devices during execution. The proposed method is more suitable for data-parallel kernels whose computation and data are uniformly distributed, but is less suitable for data-parallel kernels whose computation and data are non-uniformly distributed. Thus, an asynchronous-based dynamic and elastic task scheduling scheme is proposed, which can avoid device underutilization, load imbalance across devices, and frequent kernel launches, inter-device data transfers and inter-device synchronizations by dynamically adjusting the chunk size according to the performance change during runtime. A series of experiments are conducted with 8 representative parallel applications on a hybrid CPU-GPU-MIC system, the results show that the proposed two inter-device task scheduling schemes can achieve the efficient CPU-GPU-MIC co-processing of different parallel applications by effectively partitioning work across devices. |
topic |
Data-parallel kernels heterogeneous systems many-core accelerators multi-core CPUs multi-device co-processing parallel applications |
url |
https://ieeexplore.ieee.org/document/9406583/ |
work_keys_str_mv |
AT lanjunwan efficientinterdevicetaskschedulingschemesformultidevicecoprocessingofdataparallelkernelsonheterogeneoussystems AT weihuazheng efficientinterdevicetaskschedulingschemesformultidevicecoprocessingofdataparallelkernelsonheterogeneoussystems AT xinpanyuan efficientinterdevicetaskschedulingschemesformultidevicecoprocessingofdataparallelkernelsonheterogeneoussystems |
_version_ |
1721512324017160192 |