Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning
High proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization mod...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9000885/ |
id |
doaj-69ec55bb5dd54eb58c606061a2d2b5cb |
---|---|
record_format |
Article |
spelling |
doaj-69ec55bb5dd54eb58c606061a2d2b5cb2021-03-30T02:03:23ZengIEEEIEEE Access2169-35362020-01-018347703478210.1109/ACCESS.2020.29745039000885Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement LearningZhuoming Deng0https://orcid.org/0000-0001-5670-6377Zhilin Lu1https://orcid.org/0000-0002-7256-234XZhifei Guo2https://orcid.org/0000-0002-5169-6357Wenfeng Yao3https://orcid.org/0000-0002-3574-7046Wenmeng Zhao4https://orcid.org/0000-0003-3163-4340Baorong Zhou5https://orcid.org/0000-0002-1944-7633Chao Hong6https://orcid.org/0000-0003-4489-4424Electric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaSchool of Electric Power Engineering, South China University of Technology, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaElectric Power Research Institute, China Southern Power Grid, Guangzhou, ChinaHigh proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization model. With equipment dynamics, network load flows, lower and upper limitations, and security constraints considered, this model simultaneously minimizes two objectives: the expense of control decision and the voltage deviation. The Radau collocation method is employed to handle dynamics, by transforming all differential algebraic equations into algebraic ones. Most importantly, Pareto solutions are obtained through an accelerated multi-objective reinforcement learning (AMORL) method by filtering the dominated solutions. The entire feasible region is partitioned into small independent regions, to eliminate the scope for Pareto solutions. Besides, the AMORL method redefines the state functions and introduces creative state sensitivities, which accelerate the switch from learning to applying, once the agent accumulates sufficient knowledge. Furthermore, Pareto solutions are diversified via introducing some potential solutions. Lastly, the Fuzzy decision-making methodology picks up the tradeoff solution. Case studies are implemented on a practical 748-node power grid, which validate the acceleration and efficiency of the AMORL method. The AMORL method is overall superior to conventional reinforcement learning (RL) method with more optimal non-dominated objective values, much shorter CPU time, and better convergence to accurate values. Moreover, compared with another three state-of-the-art RL methods, the AMORL method takes almost the same CPU time of several seconds, but is slightly superior to the state-of-the-art methods in terms of optimal objective values. Additionally, the calculated values of the AMORL method fit the best with the accurate values during each iteration, resulting in a good convergence.https://ieeexplore.ieee.org/document/9000885/Accelerated multi-objective reinforcement learningdynamic optimizationPareto solutionsshort-term voltage security |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zhuoming Deng Zhilin Lu Zhifei Guo Wenfeng Yao Wenmeng Zhao Baorong Zhou Chao Hong |
spellingShingle |
Zhuoming Deng Zhilin Lu Zhifei Guo Wenfeng Yao Wenmeng Zhao Baorong Zhou Chao Hong Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning IEEE Access Accelerated multi-objective reinforcement learning dynamic optimization Pareto solutions short-term voltage security |
author_facet |
Zhuoming Deng Zhilin Lu Zhifei Guo Wenfeng Yao Wenmeng Zhao Baorong Zhou Chao Hong |
author_sort |
Zhuoming Deng |
title |
Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning |
title_short |
Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning |
title_full |
Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning |
title_fullStr |
Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning |
title_full_unstemmed |
Coordinated Optimization of Generation and Compensation to Enhance Short-Term Voltage Security of Power Systems Using Accelerated Multi-Objective Reinforcement Learning |
title_sort |
coordinated optimization of generation and compensation to enhance short-term voltage security of power systems using accelerated multi-objective reinforcement learning |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
High proportions of asynchronous motors in demand-side have pressured heavily on short-term voltage security of receiving-end power systems. To enhance short-term voltage security, this paper coordinates the optimal outputs of generation and compensation in a multi-objective dynamic optimization model. With equipment dynamics, network load flows, lower and upper limitations, and security constraints considered, this model simultaneously minimizes two objectives: the expense of control decision and the voltage deviation. The Radau collocation method is employed to handle dynamics, by transforming all differential algebraic equations into algebraic ones. Most importantly, Pareto solutions are obtained through an accelerated multi-objective reinforcement learning (AMORL) method by filtering the dominated solutions. The entire feasible region is partitioned into small independent regions, to eliminate the scope for Pareto solutions. Besides, the AMORL method redefines the state functions and introduces creative state sensitivities, which accelerate the switch from learning to applying, once the agent accumulates sufficient knowledge. Furthermore, Pareto solutions are diversified via introducing some potential solutions. Lastly, the Fuzzy decision-making methodology picks up the tradeoff solution. Case studies are implemented on a practical 748-node power grid, which validate the acceleration and efficiency of the AMORL method. The AMORL method is overall superior to conventional reinforcement learning (RL) method with more optimal non-dominated objective values, much shorter CPU time, and better convergence to accurate values. Moreover, compared with another three state-of-the-art RL methods, the AMORL method takes almost the same CPU time of several seconds, but is slightly superior to the state-of-the-art methods in terms of optimal objective values. Additionally, the calculated values of the AMORL method fit the best with the accurate values during each iteration, resulting in a good convergence. |
topic |
Accelerated multi-objective reinforcement learning dynamic optimization Pareto solutions short-term voltage security |
url |
https://ieeexplore.ieee.org/document/9000885/ |
work_keys_str_mv |
AT zhuomingdeng coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT zhilinlu coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT zhifeiguo coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT wenfengyao coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT wenmengzhao coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT baorongzhou coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning AT chaohong coordinatedoptimizationofgenerationandcompensationtoenhanceshorttermvoltagesecurityofpowersystemsusingacceleratedmultiobjectivereinforcementlearning |
_version_ |
1724185849215582208 |