Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems

Heterogeneous systems consisting of multiple multi-core CPUs and many-core accelerators have recently come into wide use, and more and more parallel applications are developed in such a heterogeneous system. To fully utilize multiple compute devices to cooperatively and concurrently execute data-par...

Full description

Bibliographic Details
Main Authors: Lanjun Wan, Weihua Zheng, Xinpan Yuan
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9406583/
id doaj-98ed39d48a2b4dc397b21ed5de06e0ef
record_format Article
spelling doaj-98ed39d48a2b4dc397b21ed5de06e0ef2021-04-23T23:00:25ZengIEEEIEEE Access2169-35362021-01-019599685997810.1109/ACCESS.2021.30739559406583Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous SystemsLanjun Wan0https://orcid.org/0000-0001-7236-3589Weihua Zheng1Xinpan Yuan2School of Computer Science, Hunan University of Technology, Zhuzhou, ChinaCollege of Electrical and Information Engineering, Hunan University of Technology, Zhuzhou, ChinaSchool of Computer Science, Hunan University of Technology, Zhuzhou, ChinaHeterogeneous systems consisting of multiple multi-core CPUs and many-core accelerators have recently come into wide use, and more and more parallel applications are developed in such a heterogeneous system. To fully utilize multiple compute devices to cooperatively and concurrently execute data-parallel kernels on heterogeneous systems, a feedback-based dynamic and elastic task scheduling scheme is proposed, which can provide a better load balance, a greater device utilization, and a lower scheduling overhead by flexibly and dynamically adjusting the workload between devices during execution. The proposed method is more suitable for data-parallel kernels whose computation and data are uniformly distributed, but is less suitable for data-parallel kernels whose computation and data are non-uniformly distributed. Thus, an asynchronous-based dynamic and elastic task scheduling scheme is proposed, which can avoid device underutilization, load imbalance across devices, and frequent kernel launches, inter-device data transfers and inter-device synchronizations by dynamically adjusting the chunk size according to the performance change during runtime. A series of experiments are conducted with 8 representative parallel applications on a hybrid CPU-GPU-MIC system, the results show that the proposed two inter-device task scheduling schemes can achieve the efficient CPU-GPU-MIC co-processing of different parallel applications by effectively partitioning work across devices.https://ieeexplore.ieee.org/document/9406583/Data-parallel kernelsheterogeneous systemsmany-core acceleratorsmulti-core CPUsmulti-device co-processingparallel applications
collection DOAJ
language English
format Article
sources DOAJ
author Lanjun Wan
Weihua Zheng
Xinpan Yuan
spellingShingle Lanjun Wan
Weihua Zheng
Xinpan Yuan
Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
IEEE Access
Data-parallel kernels
heterogeneous systems
many-core accelerators
multi-core CPUs
multi-device co-processing
parallel applications
author_facet Lanjun Wan
Weihua Zheng
Xinpan Yuan
author_sort Lanjun Wan
title Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
title_short Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
title_full Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
title_fullStr Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
title_full_unstemmed Efficient Inter-Device Task Scheduling Schemes for Multi-Device Co-Processing of Data-Parallel Kernels on Heterogeneous Systems
title_sort efficient inter-device task scheduling schemes for multi-device co-processing of data-parallel kernels on heterogeneous systems
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Heterogeneous systems consisting of multiple multi-core CPUs and many-core accelerators have recently come into wide use, and more and more parallel applications are developed in such a heterogeneous system. To fully utilize multiple compute devices to cooperatively and concurrently execute data-parallel kernels on heterogeneous systems, a feedback-based dynamic and elastic task scheduling scheme is proposed, which can provide a better load balance, a greater device utilization, and a lower scheduling overhead by flexibly and dynamically adjusting the workload between devices during execution. The proposed method is more suitable for data-parallel kernels whose computation and data are uniformly distributed, but is less suitable for data-parallel kernels whose computation and data are non-uniformly distributed. Thus, an asynchronous-based dynamic and elastic task scheduling scheme is proposed, which can avoid device underutilization, load imbalance across devices, and frequent kernel launches, inter-device data transfers and inter-device synchronizations by dynamically adjusting the chunk size according to the performance change during runtime. A series of experiments are conducted with 8 representative parallel applications on a hybrid CPU-GPU-MIC system, the results show that the proposed two inter-device task scheduling schemes can achieve the efficient CPU-GPU-MIC co-processing of different parallel applications by effectively partitioning work across devices.
topic Data-parallel kernels
heterogeneous systems
many-core accelerators
multi-core CPUs
multi-device co-processing
parallel applications
url https://ieeexplore.ieee.org/document/9406583/
work_keys_str_mv AT lanjunwan efficientinterdevicetaskschedulingschemesformultidevicecoprocessingofdataparallelkernelsonheterogeneoussystems
AT weihuazheng efficientinterdevicetaskschedulingschemesformultidevicecoprocessingofdataparallelkernelsonheterogeneoussystems
AT xinpanyuan efficientinterdevicetaskschedulingschemesformultidevicecoprocessingofdataparallelkernelsonheterogeneoussystems
_version_ 1721512324017160192