Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
Buildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the s...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-02-01
|
Series: | Frontiers in Energy Research |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/full |
id |
doaj-d69015406b5c42a0875ec2dd990b1abe |
---|---|
record_format |
Article |
spelling |
doaj-d69015406b5c42a0875ec2dd990b1abe2021-02-02T16:09:22ZengFrontiers Media S.A.Frontiers in Energy Research2296-598X2021-02-01810.3389/fenrg.2020.610518610518Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning ControlHuan Zhao0Junhua Zhao1Junhua Zhao2Ting Shu3Zibin Pan4School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaShenzhen Research Institute of Big Data, Shenzhen, ChinaGuangdong-Hongkong-Macao Greater Bay Area Weather Research Center for Monitoring Warning and Forecasting, Shenzhen, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaBuildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the system and environment, such as temperature and electricity price. Deep reinforcement learning (DRL) is a model-free method that utilizes the “trial and error” mechanism to learn the optimal policy. However, the learning efficiency and learning cost are the main obstacles of the DRL method to practice. To overcome this problem, the hybrid-model-based DRL method is proposed for the HVAC control problem. Firstly, a specific MDPs is defined by considering the energy cost, temperature violation, and action violation. Then the hybrid-model-based DRL method is proposed, which utilizes both the knowledge-driven model and the data-driven model during the whole learning process. Finally, the protection mechanism and adjusting reward methods are used to further reduce the learning cost. The proposed method is tested in a simulation environment using the Australian Energy Market Operator (AEMO) electricity price data and New South Wales temperature data. Simulation results show that 1) the DRL method can reduce the energy cost while maintaining the temperature satisfactory compared to the short term MPC method; 2) the proposed method improves the learning efficiency and reduces the learning cost during the learning process compared to the model-free method.https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/fulldeep reinforcement learningmodel-based reinforcement learninghybrid modelheating, ventilation, and air-conditioning controldeep deterministic policy gradient |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Huan Zhao Junhua Zhao Junhua Zhao Ting Shu Zibin Pan |
spellingShingle |
Huan Zhao Junhua Zhao Junhua Zhao Ting Shu Zibin Pan Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control Frontiers in Energy Research deep reinforcement learning model-based reinforcement learning hybrid model heating, ventilation, and air-conditioning control deep deterministic policy gradient |
author_facet |
Huan Zhao Junhua Zhao Junhua Zhao Ting Shu Zibin Pan |
author_sort |
Huan Zhao |
title |
Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control |
title_short |
Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control |
title_full |
Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control |
title_fullStr |
Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control |
title_full_unstemmed |
Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control |
title_sort |
hybrid-model-based deep reinforcement learning for heating, ventilation, and air-conditioning control |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Energy Research |
issn |
2296-598X |
publishDate |
2021-02-01 |
description |
Buildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the system and environment, such as temperature and electricity price. Deep reinforcement learning (DRL) is a model-free method that utilizes the “trial and error” mechanism to learn the optimal policy. However, the learning efficiency and learning cost are the main obstacles of the DRL method to practice. To overcome this problem, the hybrid-model-based DRL method is proposed for the HVAC control problem. Firstly, a specific MDPs is defined by considering the energy cost, temperature violation, and action violation. Then the hybrid-model-based DRL method is proposed, which utilizes both the knowledge-driven model and the data-driven model during the whole learning process. Finally, the protection mechanism and adjusting reward methods are used to further reduce the learning cost. The proposed method is tested in a simulation environment using the Australian Energy Market Operator (AEMO) electricity price data and New South Wales temperature data. Simulation results show that 1) the DRL method can reduce the energy cost while maintaining the temperature satisfactory compared to the short term MPC method; 2) the proposed method improves the learning efficiency and reduces the learning cost during the learning process compared to the model-free method. |
topic |
deep reinforcement learning model-based reinforcement learning hybrid model heating, ventilation, and air-conditioning control deep deterministic policy gradient |
url |
https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/full |
work_keys_str_mv |
AT huanzhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT junhuazhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT junhuazhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT tingshu hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT zibinpan hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol |
_version_ |
1724293084248801280 |