Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control

Buildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the s...

Full description

Bibliographic Details
Main Authors:	Huan Zhao, Junhua Zhao, Ting Shu, Zibin Pan
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2021-02-01
Series:	Frontiers in Energy Research
Subjects:	deep reinforcement learning model-based reinforcement learning hybrid model heating, ventilation, and air-conditioning control deep deterministic policy gradient
Online Access:	https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/full

id	doaj-d69015406b5c42a0875ec2dd990b1abe
record_format	Article
spelling	doaj-d69015406b5c42a0875ec2dd990b1abe2021-02-02T16:09:22ZengFrontiers Media S.A.Frontiers in Energy Research2296-598X2021-02-01810.3389/fenrg.2020.610518610518Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning ControlHuan Zhao0Junhua Zhao1Junhua Zhao2Ting Shu3Zibin Pan4School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaShenzhen Research Institute of Big Data, Shenzhen, ChinaGuangdong-Hongkong-Macao Greater Bay Area Weather Research Center for Monitoring Warning and Forecasting, Shenzhen, ChinaSchool of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, ChinaBuildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the system and environment, such as temperature and electricity price. Deep reinforcement learning (DRL) is a model-free method that utilizes the “trial and error” mechanism to learn the optimal policy. However, the learning efficiency and learning cost are the main obstacles of the DRL method to practice. To overcome this problem, the hybrid-model-based DRL method is proposed for the HVAC control problem. Firstly, a specific MDPs is defined by considering the energy cost, temperature violation, and action violation. Then the hybrid-model-based DRL method is proposed, which utilizes both the knowledge-driven model and the data-driven model during the whole learning process. Finally, the protection mechanism and adjusting reward methods are used to further reduce the learning cost. The proposed method is tested in a simulation environment using the Australian Energy Market Operator (AEMO) electricity price data and New South Wales temperature data. Simulation results show that 1) the DRL method can reduce the energy cost while maintaining the temperature satisfactory compared to the short term MPC method; 2) the proposed method improves the learning efficiency and reduces the learning cost during the learning process compared to the model-free method.https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/fulldeep reinforcement learningmodel-based reinforcement learninghybrid modelheating, ventilation, and air-conditioning controldeep deterministic policy gradient
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Huan Zhao Junhua Zhao Junhua Zhao Ting Shu Zibin Pan
spellingShingle	Huan Zhao Junhua Zhao Junhua Zhao Ting Shu Zibin Pan Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control Frontiers in Energy Research deep reinforcement learning model-based reinforcement learning hybrid model heating, ventilation, and air-conditioning control deep deterministic policy gradient
author_facet	Huan Zhao Junhua Zhao Junhua Zhao Ting Shu Zibin Pan
author_sort	Huan Zhao
title	Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_short	Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_full	Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_fullStr	Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_full_unstemmed	Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control
title_sort	hybrid-model-based deep reinforcement learning for heating, ventilation, and air-conditioning control
publisher	Frontiers Media S.A.
series	Frontiers in Energy Research
issn	2296-598X
publishDate	2021-02-01
description	Buildings account for a large proportion of the total energy consumption in many countries and almost half of the energy consumption is caused by the Heating, Ventilation, and air-conditioning (HVAC) systems. The model predictive control of HVAC is a complex task due to the dynamic property of the system and environment, such as temperature and electricity price. Deep reinforcement learning (DRL) is a model-free method that utilizes the “trial and error” mechanism to learn the optimal policy. However, the learning efficiency and learning cost are the main obstacles of the DRL method to practice. To overcome this problem, the hybrid-model-based DRL method is proposed for the HVAC control problem. Firstly, a specific MDPs is defined by considering the energy cost, temperature violation, and action violation. Then the hybrid-model-based DRL method is proposed, which utilizes both the knowledge-driven model and the data-driven model during the whole learning process. Finally, the protection mechanism and adjusting reward methods are used to further reduce the learning cost. The proposed method is tested in a simulation environment using the Australian Energy Market Operator (AEMO) electricity price data and New South Wales temperature data. Simulation results show that 1) the DRL method can reduce the energy cost while maintaining the temperature satisfactory compared to the short term MPC method; 2) the proposed method improves the learning efficiency and reduces the learning cost during the learning process compared to the model-free method.
topic	deep reinforcement learning model-based reinforcement learning hybrid model heating, ventilation, and air-conditioning control deep deterministic policy gradient
url	https://www.frontiersin.org/articles/10.3389/fenrg.2020.610518/full
work_keys_str_mv	AT huanzhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT junhuazhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT junhuazhao hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT tingshu hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol AT zibinpan hybridmodelbaseddeepreinforcementlearningforheatingventilationandairconditioningcontrol
_version_	1724293084248801280

Hybrid-Model-Based Deep Reinforcement Learning for Heating, Ventilation, and Air-Conditioning Control

Similar Items