A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system

The main objective of reservoir operations planning is to determine the optimum operation policies that maximize the expected value of the system resources over the planning horizon. This control problem is challenged with different sources of uncertainty that a reservoir system planner has to de...

Full description

Bibliographic Details
Main Author: Abdalla, Alaa Eatzaz
Language:English
Published: University of British Columbia 2011
Online Access:http://hdl.handle.net/2429/30702
id ndltd-UBC-oai-circle.library.ubc.ca-2429-30702
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-307022018-01-05T17:45:38Z A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system Abdalla, Alaa Eatzaz The main objective of reservoir operations planning is to determine the optimum operation policies that maximize the expected value of the system resources over the planning horizon. This control problem is challenged with different sources of uncertainty that a reservoir system planner has to deal with. In the reservoir operations planning problem, there is a trade-off between the marginal value of water in storage and the electricity market price. The marginal value of water is uncertain too and is largely dependent on storage in the reservoir and storage in other reservoirs as well. The challenge here is how to deal with this large scale multireservoir problem under the encountered uncertainties. In this thesis, the use of a novel methodology to establish a good approximation of the optimal control of a large-scale hydroelectric power system applying Reinforcement Learning (RL) is presented. RL is an artificial intelligence method to machine learning that offers key advantages in handling problems that are too large to be solved by conventional dynamic programming methods. In this approach, a control agent progressively learns the optimal strategies that maximize rewards through interaction with a dynamic environment. This thesis introduces the main concepts and computational aspects of using RL for the multireservoir operations planning problem. A scenario generation-moment matching technique was adopted to generate a set of scenarios for the natural river inflows, electricity load, and market prices random variables. In this way, the statistical properties of the original distributions are preserved. The developed reinforcement learning reservoir optimization model (RLROM) was successfully applied to the BC Hydro main reservoirs on the Peace and Columbia Rivers. The model was used to: derive optimal control policies for this multireservoir system, to estimate the value of water in storage, and to establish the marginal value of water / energy. The RLROM outputs were compared to the classical method of optimizing reservoir operations, namely, stochastic dynamic programming (SDP), and the results for one and two reservoir systems were identical. The results suggests that the RL model is much more efficient at handling large scale reservoir operations problems and can give a very good approximate solution to this complex problem. Applied Science, Faculty of Civil Engineering, Department of Graduate 2011-01-19T23:27:30Z 2011-01-19T23:27:30Z 2007 Text Thesis/Dissertation http://hdl.handle.net/2429/30702 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. University of British Columbia
collection NDLTD
language English
sources NDLTD
description The main objective of reservoir operations planning is to determine the optimum operation policies that maximize the expected value of the system resources over the planning horizon. This control problem is challenged with different sources of uncertainty that a reservoir system planner has to deal with. In the reservoir operations planning problem, there is a trade-off between the marginal value of water in storage and the electricity market price. The marginal value of water is uncertain too and is largely dependent on storage in the reservoir and storage in other reservoirs as well. The challenge here is how to deal with this large scale multireservoir problem under the encountered uncertainties. In this thesis, the use of a novel methodology to establish a good approximation of the optimal control of a large-scale hydroelectric power system applying Reinforcement Learning (RL) is presented. RL is an artificial intelligence method to machine learning that offers key advantages in handling problems that are too large to be solved by conventional dynamic programming methods. In this approach, a control agent progressively learns the optimal strategies that maximize rewards through interaction with a dynamic environment. This thesis introduces the main concepts and computational aspects of using RL for the multireservoir operations planning problem. A scenario generation-moment matching technique was adopted to generate a set of scenarios for the natural river inflows, electricity load, and market prices random variables. In this way, the statistical properties of the original distributions are preserved. The developed reinforcement learning reservoir optimization model (RLROM) was successfully applied to the BC Hydro main reservoirs on the Peace and Columbia Rivers. The model was used to: derive optimal control policies for this multireservoir system, to estimate the value of water in storage, and to establish the marginal value of water / energy. The RLROM outputs were compared to the classical method of optimizing reservoir operations, namely, stochastic dynamic programming (SDP), and the results for one and two reservoir systems were identical. The results suggests that the RL model is much more efficient at handling large scale reservoir operations problems and can give a very good approximate solution to this complex problem. === Applied Science, Faculty of === Civil Engineering, Department of === Graduate
author Abdalla, Alaa Eatzaz
spellingShingle Abdalla, Alaa Eatzaz
A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
author_facet Abdalla, Alaa Eatzaz
author_sort Abdalla, Alaa Eatzaz
title A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
title_short A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
title_full A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
title_fullStr A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
title_full_unstemmed A reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
title_sort reinforcement learning algorithm for operations planning of a hydroelectric power multireservoir system
publisher University of British Columbia
publishDate 2011
url http://hdl.handle.net/2429/30702
work_keys_str_mv AT abdallaalaaeatzaz areinforcementlearningalgorithmforoperationsplanningofahydroelectricpowermultireservoirsystem
AT abdallaalaaeatzaz reinforcementlearningalgorithmforoperationsplanningofahydroelectricpowermultireservoirsystem
_version_ 1718594177548156928