Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning

High proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization mod...

Full description

Bibliographic Details
Main Authors:	Zhuoming Deng, Zhilin Lu, Zhifei Guo, Wenfeng Yao, Wenmeng Zhao, Baorong Zhou, Chao Hong
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Accelerated multi-objective reinforcement learning dynamic optimization Pareto solutions short-term voltage security
Online Access:	https://ieeexplore.ieee.org/document/9000885/

id	doaj-69ec55bb5dd54eb58c606061a2d2b5cb
record_format	Article
spelling	doaj-69ec55bb5dd54eb58c606061a2d2b5cb2021-03-30T02:03:23ZengIEEEIEEE Access2169-35362020-01-018347703478210.1109/ACCESS.2020.29745039000885Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement LearningZhuoming Deng0https://orcid.org/0000-0001-5670-6377Zhilin Lu1https://orcid.org/0000-0002-7256-234XZhifei Guo2https://orcid.org/0000-0002-5169-6357Wenfeng Yao3https://orcid.org/0000-0002-3574-7046Wenmeng Zhao4https://orcid.org/0000-0003-3163-4340Baorong Zhou5https://orcid.org/0000-0002-1944-7633Chao Hong6https://orcid.org/0000-0003-4489-4424Electric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaSchool of Electric Power Engineering, South China University of Technology, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaHigh proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization model. With equipment dynamics, network load flows, lower and upper limitations, and security constraints considered, this model simultaneously minimizes two objectives: the expense of control decision and the voltage deviation. The Radau collocation method is employed to handle dynamics, by transforming all differential algebraic equations into algebraic ones. Most importantly, Pareto solutions are obtained through an accelerated multi-objective reinforcement learning (AMORL) method by filtering the dominated solutions. The entire feasible region is partitioned into small independent regions, to eliminate the scope for Pareto solutions. Besides, the AMORL method redefines the state functions and introduces creative state sensitivities, which accelerate the switch from learning to applying, once the agent accumulates sufficient knowledge. Furthermore, Pareto solutions are diversified via introducing some potential solutions. Lastly, the Fuzzy decision-making methodology picks up the tradeoff solution. Case studies are implemented on a practical 748-node power grid, which validate the acceleration and efficiency of the AMORL method. The AMORL method is overall superior to conventional reinforcement learning (RL) method with more optimal non-dominated objective values, much shorter CPU time, and better convergence to accurate values. Moreover, compared with another three state-of-the-art RL methods, the AMORL method takes almost the same CPU time of several seconds, but is slightly superior to the state-of-the-art methods in terms of optimal objective values. Additionally, the calculated values of the AMORL method fit the best with the accurate values during each iteration, resulting in a good convergence.https://ieeexplore.ieee.org/document/9000885/Accelerated multi-objective reinforcement learningdynamic optimizationPareto solutionsshort-term voltage security
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zhuoming Deng Zhilin Lu Zhifei Guo Wenfeng Yao Wenmeng Zhao Baorong Zhou Chao Hong
spellingShingle	Zhuoming Deng Zhilin Lu Zhifei Guo Wenfeng Yao Wenmeng Zhao Baorong Zhou Chao Hong Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning IEEE Access Accelerated multi-objective reinforcement learning dynamic optimization Pareto solutions short-term voltage security
author_facet	Zhuoming Deng Zhilin Lu Zhifei Guo Wenfeng Yao Wenmeng Zhao Baorong Zhou Chao Hong
author_sort	Zhuoming Deng
title	Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_short	Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_full	Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_fullStr	Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_full_unstemmed	Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
title_sort	coordinated optimization of generation and compensation to enhance short-term voltage security of power systems using accelerated multi-objective reinforcement learning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	High proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization model. With equipment dynamics, network load flows, lower and upper limitations, and security constraints considered, this model simultaneously minimizes two objectives: the expense of control decision and the voltage deviation. The Radau collocation method is employed to handle dynamics, by transforming all differential algebraic equations into algebraic ones. Most importantly, Pareto solutions are obtained through an accelerated multi-objective reinforcement learning (AMORL) method by filtering the dominated solutions. The entire feasible region is partitioned into small independent regions, to eliminate the scope for Pareto solutions. Besides, the AMORL method redefines the state functions and introduces creative state sensitivities, which accelerate the switch from learning to applying, once the agent accumulates sufficient knowledge. Furthermore, Pareto solutions are diversified via introducing some potential solutions. Lastly, the Fuzzy decision-making methodology picks up the tradeoff solution. Case studies are implemented on a practical 748-node power grid, which validate the acceleration and efficiency of the AMORL method. The AMORL method is overall superior to conventional reinforcement learning (RL) method with more optimal non-dominated objective values, much shorter CPU time, and better convergence to accurate values. Moreover, compared with another three state-of-the-art RL methods, the AMORL method takes almost the same CPU time of several seconds, but is slightly superior to the state-of-the-art methods in terms of optimal objective values. Additionally, the calculated values of the AMORL method fit the best with the accurate values during each iteration, resulting in a good convergence.
topic	Accelerated multi-objective reinforcement learning dynamic optimization Pareto solutions short-term voltage security
url	https://ieeexplore.ieee.org/document/9000885/
work_keys_str_mv	AT zhuomingdeng coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT zhilinlu coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT zhifeiguo coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT wenfengyao coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT wenmengzhao coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT baorongzhou coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT chaohong coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning
_version_	1724185849215582208

Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning

Similar Items