A Two-Stage Framework for the Multi-User Multi-Data Center Job Scheduling and Resource Allocation

With the development of artificial intelligence and the Internet of things, the prospects of cloud computing applications have become broader, and the number of users and cloud data centers (CDCs) has exploded. It is a challenge to realize efficient job scheduling and resource allocation of multiple...

Full description

Bibliographic Details
Main Authors: Jianpeng Lin, Delong Cui, Zhiping Peng, Qirui Li, Jieguang He
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9239288/
Description
Summary:With the development of artificial intelligence and the Internet of things, the prospects of cloud computing applications have become broader, and the number of users and cloud data centers (CDCs) has exploded. It is a challenge to realize efficient job scheduling and resource allocation of multiple users and data centers. However, the traditional scheduling model based on heuristic algorithm has some limitations in the complex and changeable cloud environment. In addition, many existing single-agent models rarely consider the multi-objective global optimization of the system. Therefore, this paper proposes a two-stage job scheduling and resource allocation framework that adopts multiple intelligent schedulers to solve the cooperative scheduling problem between job scheduling and resource allocation. A heterogeneous distributed deep learning (HDDL) model is used in the job scheduling stage to schedule multiple jobs to multiple cloud data centers. The deep Q-network (DQN) model is a resource scheduler to deploy virtual machine to physical servers for execution. Extensive numerical results show that both HDDL-baesd job scheduler and DQN-based resource allocator outperform the benchmark algorithm in terms of optimizing energy consumption and job delay. Furthermore, the proposed framework not only can achieve a global near-optimum by achieving local optimization at each stage but also has good scalability and low computation delay.
ISSN:2169-3536