Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning

High proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization mod...

Full description

Bibliographic Details
Main Authors: Zhuoming Deng, Zhilin Lu, Zhifei Guo, Wenfeng Yao, Wenmeng Zhao, Baorong Zhou, Chao Hong
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9000885/
id doaj-69ec55bb5dd54eb58c606061a2d2b5cb
record_format Article
spelling doaj-69ec55bb5dd54eb58c606061a2d2b5cb2021-03-30T02:03:23ZengIEEEIEEE Access2169-35362020-01-018347703478210.1109/ACCESS.2020.29745039000885Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement LearningZhuoming Deng0https://orcid.org/0000-0001-5670-6377Zhilin Lu1https://orcid.org/0000-0002-7256-234XZhifei Guo2https://orcid.org/0000-0002-5169-6357Wenfeng Yao3https://orcid.org/0000-0002-3574-7046Wenmeng Zhao4https://orcid.org/0000-0003-3163-4340Baorong Zhou5https://orcid.org/0000-0002-1944-7633Chao Hong6https://orcid.org/0000-0003-4489-4424Electric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaSchool of Electric Power Engineering, South China University of Technology, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaHigh proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization model. With equipment dynamics, network load flows, lower and upper limitations, and security constraints considered, this model simultaneously minimizes two objectives: the expense of control decision and the voltage deviation. The Radau collocation method is employed to handle dynamics, by transforming all differential algebraic equations into algebraic ones. Most importantly, Pareto solutions are obtained through an accelerated multi-objective reinforcement learning (AMORL) method by filtering the dominated solutions. The entire feasible region is partitioned into small independent regions, to eliminate the scope for Pareto solutions. Besides, the AMORL method redefines the state functions and introduces creative state sensitivities, which accelerate the switch from learning to applying, once the agent accumulates sufficient knowledge. Furthermore, Pareto solutions are diversified via introducing some potential solutions. Lastly, the Fuzzy decision-making methodology picks up the tradeoff solution. Case studies are implemented on a practical 748-node power grid, which validate the acceleration and efficiency of the AMORL method. The AMORL method is overall superior to conventional reinforcement learning (RL) method with more optimal non-dominated objective values, much shorter CPU time, and better convergence to accurate values. Moreover, compared with another three state-of-the-art RL methods, the AMORL method takes almost the same CPU time of several seconds, but is slightly superior to the state-of-the-art methods in terms of optimal objective values. Additionally, the calculated values of the AMORL method fit the best with the accurate values during each iteration, resulting in a good convergence.https://ieeexplore.ieee.org/document/9000885/Accelerated multi-objective reinforcement learningdynamic optimizationPareto solutionsshort-term voltage security
collection DOAJ
language English
format Article
sources DOAJ
author Zhuoming Deng
Zhilin Lu
Zhifei Guo
Wenfeng Yao
Wenmeng Zhao
Baorong Zhou
Chao Hong
spellingShingle Zhuoming Deng
Zhilin Lu
Zhifei Guo
Wenfeng Yao
Wenmeng Zhao
Baorong Zhou
Chao Hong
Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
IEEE Access
Accelerated multi-objective reinforcement learning
dynamic optimization
Pareto solutions
short-term voltage security
author_facet Zhuoming Deng
Zhilin Lu
Zhifei Guo
Wenfeng Yao
Wenmeng Zhao
Baorong Zhou
Chao Hong
author_sort Zhuoming Deng
title Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_short Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_full Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_fullStr Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_full_unstemmed Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_sort coordinated optimization of generation and compensation to enhance short-term voltage security of power systems using accelerated multi-objective reinforcement learning
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description High proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization model. With equipment dynamics, network load flows, lower and upper limitations, and security constraints considered, this model simultaneously minimizes two objectives: the expense of control decision and the voltage deviation. The Radau collocation method is employed to handle dynamics, by transforming all differential algebraic equations into algebraic ones. Most importantly, Pareto solutions are obtained through an accelerated multi-objective reinforcement learning (AMORL) method by filtering the dominated solutions. The entire feasible region is partitioned into small independent regions, to eliminate the scope for Pareto solutions. Besides, the AMORL method redefines the state functions and introduces creative state sensitivities, which accelerate the switch from learning to applying, once the agent accumulates sufficient knowledge. Furthermore, Pareto solutions are diversified via introducing some potential solutions. Lastly, the Fuzzy decision-making methodology picks up the tradeoff solution. Case studies are implemented on a practical 748-node power grid, which validate the acceleration and efficiency of the AMORL method. The AMORL method is overall superior to conventional reinforcement learning (RL) method with more optimal non-dominated objective values, much shorter CPU time, and better convergence to accurate values. Moreover, compared with another three state-of-the-art RL methods, the AMORL method takes almost the same CPU time of several seconds, but is slightly superior to the state-of-the-art methods in terms of optimal objective values. Additionally, the calculated values of the AMORL method fit the best with the accurate values during each iteration, resulting in a good convergence.
topic Accelerated multi-objective reinforcement learning
dynamic optimization
Pareto solutions
short-term voltage security
url https://ieeexplore.ieee.org/document/9000885/
work_keys_str_mv AT zhuomingdeng coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
AT zhilinlu coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
AT zhifeiguo coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
AT wenfengyao coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
AT wenmengzhao coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
AT baorongzhou coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
AT chaohong coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
_version_ 1724185849215582208