Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
In this paper we study the optimization of the discrete-time stochastic linear-quadratic (LQ) control problem with conic control constraints on an infinite horizon, considering multiplicative noises. Stochastic control systems can be formulated as Markov Decision Problems (MDPs) with continuous stat...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-02-01
|
Series: | Algorithms |
Subjects: | |
Online Access: | https://www.mdpi.com/1999-4893/13/2/49 |
id |
doaj-43cd66f56dee43779bdfc01df050d199 |
---|---|
record_format |
Article |
spelling |
doaj-43cd66f56dee43779bdfc01df050d1992020-11-25T01:55:07ZengMDPI AGAlgorithms1999-48932020-02-011324910.3390/a13020049a13020049Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based ApproachRuobing Xue0Xiangshen Ye1Weiping Wu2Department of Automation, Shanghai Jiaotong University, Shanghai 200240, ChinaDepartment of Automation, Shanghai Jiaotong University, Shanghai 200240, ChinaSchool of Economics and Management, Fuzhou University, Fuzhou 350108, ChinaIn this paper we study the optimization of the discrete-time stochastic linear-quadratic (LQ) control problem with conic control constraints on an infinite horizon, considering multiplicative noises. Stochastic control systems can be formulated as Markov Decision Problems (MDPs) with continuous state spaces and therefore we can apply the direct-comparison based optimization approach to solve the problem. We first derive the performance difference formula for the LQ problem by utilizing the state separation property of the system structure. Based on this, we successfully derive the optimality conditions and the stationary optimal feedback control. By introducing the optimization, we establish a general framework for infinite horizon stochastic control problems. The direct-comparison based approach is applicable to both linear and nonlinear systems. Our work provides a new perspective in LQ control problems; based on this approach, learning based algorithms can be developed without identifying all of the system parameters.https://www.mdpi.com/1999-4893/13/2/49linear-quadraticmarkov decision process(mdp)conic constraintsstochastic controldirect-comparison based approach |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ruobing Xue Xiangshen Ye Weiping Wu |
spellingShingle |
Ruobing Xue Xiangshen Ye Weiping Wu Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach Algorithms linear-quadratic markov decision process(mdp) conic constraints stochastic control direct-comparison based approach |
author_facet |
Ruobing Xue Xiangshen Ye Weiping Wu |
author_sort |
Ruobing Xue |
title |
Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach |
title_short |
Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach |
title_full |
Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach |
title_fullStr |
Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach |
title_full_unstemmed |
Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach |
title_sort |
optimization of constrained stochastic linear-quadratic control on an infinite horizon: a direct-comparison based approach |
publisher |
MDPI AG |
series |
Algorithms |
issn |
1999-4893 |
publishDate |
2020-02-01 |
description |
In this paper we study the optimization of the discrete-time stochastic linear-quadratic (LQ) control problem with conic control constraints on an infinite horizon, considering multiplicative noises. Stochastic control systems can be formulated as Markov Decision Problems (MDPs) with continuous state spaces and therefore we can apply the direct-comparison based optimization approach to solve the problem. We first derive the performance difference formula for the LQ problem by utilizing the state separation property of the system structure. Based on this, we successfully derive the optimality conditions and the stationary optimal feedback control. By introducing the optimization, we establish a general framework for infinite horizon stochastic control problems. The direct-comparison based approach is applicable to both linear and nonlinear systems. Our work provides a new perspective in LQ control problems; based on this approach, learning based algorithms can be developed without identifying all of the system parameters. |
topic |
linear-quadratic markov decision process(mdp) conic constraints stochastic control direct-comparison based approach |
url |
https://www.mdpi.com/1999-4893/13/2/49 |
work_keys_str_mv |
AT ruobingxue optimizationofconstrainedstochasticlinearquadraticcontrolonaninfinitehorizonadirectcomparisonbasedapproach AT xiangshenye optimizationofconstrainedstochasticlinearquadraticcontrolonaninfinitehorizonadirectcomparisonbasedapproach AT weipingwu optimizationofconstrainedstochasticlinearquadraticcontrolonaninfinitehorizonadirectcomparisonbasedapproach |
_version_ |
1724984957867130880 |