Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning

In this paper, a data-driven optimal control method based on adaptive dynamic programming and game theory is presented for solving the output feedback solutions of the H<sub>∞</sub> control problem for linear discrete-time systems with multiple players subject to multi-source...

Full description

Bibliographic Details
Main Authors:	Zhenfei Xiao, Jinna Li, Ping Li
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Output feedback H∞ control adaptive dynamic programming game theory reinforcement learning
Online Access:	https://ieeexplore.ieee.org/document/9261350/

id	doaj-3e357ede12024911adea612ed20ecdab
record_format	Article
spelling	doaj-3e357ede12024911adea612ed20ecdab2021-03-30T03:35:00ZengIEEEIEEE Access2169-35362020-01-01820893820895110.1109/ACCESS.2020.30386749261350Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-LearningZhenfei Xiao0Jinna Li1https://orcid.org/0000-0001-9985-6308Ping Li2https://orcid.org/0000-0002-3216-6246School of Information and Control Engineering, Liaoning Shihua University, Liaoning, ChinaSchool of Information and Control Engineering, Liaoning Shihua University, Liaoning, ChinaSchool of Information and Control Engineering, Liaoning Shihua University, Liaoning, ChinaIn this paper, a data-driven optimal control method based on adaptive dynamic programming and game theory is presented for solving the output feedback solutions of the H<sub>∞</sub> control problem for linear discrete-time systems with multiple players subject to multi-source disturbances. We first transform the H<sub>∞</sub> control problem into a multi-player game problem following the theoretical solutions according to game theory. Since the system state may not be measurable, we derive the output feedback based control policies and disturbances through mathematical operations. Considering the advantages of off-policy reinforcement learning (RL) over on-policy RL, a novel off-policy game Q-learning algorithm dealing with mixed competition and cooperation among players is developed, such that the H<sub>∞</sub> control problem can be finally solved for linear multi-player systems without the knowledge of system dynamics. Moreover, rigorous proofs of algorithm convergence and unbiasedness of solutions are presented. Finally, simulation results demonstrated the effectiveness of the proposed method.https://ieeexplore.ieee.org/document/9261350/Output feedbackH∞ controladaptive dynamic programminggame theoryreinforcement learning
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Zhenfei Xiao Jinna Li Ping Li
spellingShingle	Zhenfei Xiao Jinna Li Ping Li Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning IEEE Access Output feedback H∞ control adaptive dynamic programming game theory reinforcement learning
author_facet	Zhenfei Xiao Jinna Li Ping Li
author_sort	Zhenfei Xiao
title	Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning
title_short	Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning
title_full	Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning
title_fullStr	Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning
title_full_unstemmed	Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning
title_sort	output feedback h∞ control for linear discrete-time multi-player systems with multi-source disturbances using off-policy q-learning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	In this paper, a data-driven optimal control method based on adaptive dynamic programming and game theory is presented for solving the output feedback solutions of the H<sub>∞</sub> control problem for linear discrete-time systems with multiple players subject to multi-source disturbances. We first transform the H<sub>∞</sub> control problem into a multi-player game problem following the theoretical solutions according to game theory. Since the system state may not be measurable, we derive the output feedback based control policies and disturbances through mathematical operations. Considering the advantages of off-policy reinforcement learning (RL) over on-policy RL, a novel off-policy game Q-learning algorithm dealing with mixed competition and cooperation among players is developed, such that the H<sub>∞</sub> control problem can be finally solved for linear multi-player systems without the knowledge of system dynamics. Moreover, rigorous proofs of algorithm convergence and unbiasedness of solutions are presented. Finally, simulation results demonstrated the effectiveness of the proposed method.
topic	Output feedback H∞ control adaptive dynamic programming game theory reinforcement learning
url	https://ieeexplore.ieee.org/document/9261350/
work_keys_str_mv	AT zhenfeixiao outputfeedbackhx221econtrolforlineardiscretetimemultiplayersystemswithmultisourcedisturbancesusingoffpolicyqlearning AT jinnali outputfeedbackhx221econtrolforlineardiscretetimemultiplayersystemswithmultisourcedisturbancesusingoffpolicyqlearning AT pingli outputfeedbackhx221econtrolforlineardiscretetimemultiplayersystemswithmultisourcedisturbancesusingoffpolicyqlearning
_version_	1724183169864826880

Output Feedback H&#x221E; Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning

Similar Items

Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-Learning