Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control

Buildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the s...

Full description

Bibliographic Details
Main Authors: Huan Zhao, Junhua Zhao, Ting Shu, Zibin Pan
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-02-01
Series:Frontiers in Energy Research
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/full
id doaj-d69015406b5c42a0875ec2dd990b1abe
record_format Article
spelling doaj-d69015406b5c42a0875ec2dd990b1abe2021-02-02T16:09:22ZengFrontiers Media S.A.Frontiers in Energy Research2296-598X2021-02-01810.3389/fenrg.2020.610518610518Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning ControlHuan Zhao0Junhua Zhao1Junhua Zhao2Ting Shu3Zibin Pan4School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaShenzhen Research Institute of Big Data, Shenzhen, ChinaGuangdong-Hongkong-Macao Greater Bay Area Weather Research Center for Monitoring Warning and Forecasting, Shenzhen, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaBuildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the system and environment, such as temperature and electricity price. Deep reinforcement learning (DRL) is a model-free method that utilizes the “trial and error” mechanism to learn the optimal policy. However, the learning efficiency and learning cost are the main obstacles of the DRL method to practice. To overcome this problem, the hybrid-model-based DRL method is proposed for the HVAC control problem. Firstly, a specific MDPs is defined by considering the energy cost, temperature violation, and action violation. Then the hybrid-model-based DRL method is proposed, which utilizes both the knowledge-driven model and the data-driven model during the whole learning process. Finally, the protection mechanism and adjusting reward methods are used to further reduce the learning cost. The proposed method is tested in a simulation environment using the Australian Energy Market Operator (AEMO) electricity price data and New South Wales temperature data. Simulation results show that 1) the DRL method can reduce the energy cost while maintaining the temperature satisfactory compared to the short term MPC method; 2) the proposed method improves the learning efficiency and reduces the learning cost during the learning process compared to the model-free method.https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/fulldeep reinforcement learningmodel-based reinforcement learninghybrid modelheating, ventilation, and air-conditioning controldeep deterministic policy gradient
collection DOAJ
language English
format Article
sources DOAJ
author Huan Zhao
Junhua Zhao
Junhua Zhao
Ting Shu
Zibin Pan
spellingShingle Huan Zhao
Junhua Zhao
Junhua Zhao
Ting Shu
Zibin Pan
Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
Frontiers in Energy Research
deep reinforcement learning
model-based reinforcement learning
hybrid model
heating, ventilation, and air-conditioning control
deep deterministic policy gradient
author_facet Huan Zhao
Junhua Zhao
Junhua Zhao
Ting Shu
Zibin Pan
author_sort Huan Zhao
title Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_short Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_full Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_fullStr Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_full_unstemmed Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_sort hybrid-model-based deep reinforcement learning for heating, ventilation, and air-conditioning control
publisher Frontiers Media S.A.
series Frontiers in Energy Research
issn 2296-598X
publishDate 2021-02-01
description Buildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the system and environment, such as temperature and electricity price. Deep reinforcement learning (DRL) is a model-free method that utilizes the “trial and error” mechanism to learn the optimal policy. However, the learning efficiency and learning cost are the main obstacles of the DRL method to practice. To overcome this problem, the hybrid-model-based DRL method is proposed for the HVAC control problem. Firstly, a specific MDPs is defined by considering the energy cost, temperature violation, and action violation. Then the hybrid-model-based DRL method is proposed, which utilizes both the knowledge-driven model and the data-driven model during the whole learning process. Finally, the protection mechanism and adjusting reward methods are used to further reduce the learning cost. The proposed method is tested in a simulation environment using the Australian Energy Market Operator (AEMO) electricity price data and New South Wales temperature data. Simulation results show that 1) the DRL method can reduce the energy cost while maintaining the temperature satisfactory compared to the short term MPC method; 2) the proposed method improves the learning efficiency and reduces the learning cost during the learning process compared to the model-free method.
topic deep reinforcement learning
model-based reinforcement learning
hybrid model
heating, ventilation, and air-conditioning control
deep deterministic policy gradient
url https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/full
work_keys_str_mv AT huanzhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol
AT junhuazhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol
AT junhuazhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol
AT tingshu hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol
AT zibinpan hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol
_version_ 1724293084248801280