Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach

In this paper we study the optimization of the discrete-time stochastic linear-quadratic (LQ) control problem with conic control constraints on an infinite horizon, considering multiplicative noises. Stochastic control systems can be formulated as Markov Decision Problems (MDPs) with continuous stat...

Full description

Bibliographic Details
Main Authors: Ruobing Xue, Xiangshen Ye, Weiping Wu
Format: Article
Language:English
Published: MDPI AG 2020-02-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/13/2/49
id doaj-43cd66f56dee43779bdfc01df050d199
record_format Article
spelling doaj-43cd66f56dee43779bdfc01df050d1992020-11-25T01:55:07ZengMDPI AGAlgorithms1999-48932020-02-011324910.3390/a13020049a13020049Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based ApproachRuobing Xue0Xiangshen Ye1Weiping Wu2Department of Automation, Shanghai Jiaotong University, Shanghai 200240, ChinaDepartment of Automation, Shanghai Jiaotong University, Shanghai 200240, ChinaSchool of Economics and Management, Fuzhou University, Fuzhou 350108, ChinaIn this paper we study the optimization of the discrete-time stochastic linear-quadratic (LQ) control problem with conic control constraints on an infinite horizon, considering multiplicative noises. Stochastic control systems can be formulated as Markov Decision Problems (MDPs) with continuous state spaces and therefore we can apply the direct-comparison based optimization approach to solve the problem. We first derive the performance difference formula for the LQ problem by utilizing the state separation property of the system structure. Based on this, we successfully derive the optimality conditions and the stationary optimal feedback control. By introducing the optimization, we establish a general framework for infinite horizon stochastic control problems. The direct-comparison based approach is applicable to both linear and nonlinear systems. Our work provides a new perspective in LQ control problems; based on this approach, learning based algorithms can be developed without identifying all of the system parameters.https://www.mdpi.com/1999-4893/13/2/49linear-quadraticmarkov decision process(mdp)conic constraintsstochastic controldirect-comparison based approach
collection DOAJ
language English
format Article
sources DOAJ
author Ruobing Xue
Xiangshen Ye
Weiping Wu
spellingShingle Ruobing Xue
Xiangshen Ye
Weiping Wu
Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
Algorithms
linear-quadratic
markov decision process(mdp)
conic constraints
stochastic control
direct-comparison based approach
author_facet Ruobing Xue
Xiangshen Ye
Weiping Wu
author_sort Ruobing Xue
title Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
title_short Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
title_full Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
title_fullStr Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
title_full_unstemmed Optimization of Constrained Stochastic Linear-Quadratic Control on an Infinite Horizon: A Direct-Comparison Based Approach
title_sort optimization of constrained stochastic linear-quadratic control on an infinite horizon: a direct-comparison based approach
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2020-02-01
description In this paper we study the optimization of the discrete-time stochastic linear-quadratic (LQ) control problem with conic control constraints on an infinite horizon, considering multiplicative noises. Stochastic control systems can be formulated as Markov Decision Problems (MDPs) with continuous state spaces and therefore we can apply the direct-comparison based optimization approach to solve the problem. We first derive the performance difference formula for the LQ problem by utilizing the state separation property of the system structure. Based on this, we successfully derive the optimality conditions and the stationary optimal feedback control. By introducing the optimization, we establish a general framework for infinite horizon stochastic control problems. The direct-comparison based approach is applicable to both linear and nonlinear systems. Our work provides a new perspective in LQ control problems; based on this approach, learning based algorithms can be developed without identifying all of the system parameters.
topic linear-quadratic
markov decision process(mdp)
conic constraints
stochastic control
direct-comparison based approach
url https://www.mdpi.com/1999-4893/13/2/49
work_keys_str_mv AT ruobingxue optimizationofconstrainedstochasticlinearquadraticcontrolonaninfinitehorizonadirectcomparisonbasedapproach
AT xiangshenye optimizationofconstrainedstochasticlinearquadraticcontrolonaninfinitehorizonadirectcomparisonbasedapproach
AT weipingwu optimizationofconstrainedstochasticlinearquadraticcontrolonaninfinitehorizonadirectcomparisonbasedapproach
_version_ 1724984957867130880