CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Made available in DSpace on 2016-08-17T14:53:22Z (GMT). No. of bitstreams: 1 Dissertacao Allan James.pdf: 3170694 bytes, checksum: 054a9e74e81a7c2099800246d0b6c530 (MD5) Previous issue date: 2012-09-28 === Coordenação de Aperfeiçoamento de Pessoal de Nível Superior === The union of methodologies...

Full description

Bibliographic Details
Main Author:	Maciel, Allan James Ferreira
Other Authors:	Fonseca Neto, João Viana da
Format:	Others
Language:	Portuguese
Published:	Universidade Federal do Maranhão 2016
Subjects:	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
Online Access:	http://tedebc.ufma.br:8080/jspui/handle/tede/494

id	ndltd-IBICT-oai-tede2-tede-494
record_format	oai_dc
spelling	ndltd-IBICT-oai-tede2-tede-4942019-01-22T00:41:43Z CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING Maciel, Allan James Ferreira Fonseca Neto, João Viana da Serra, Ginalber Luiz de Oliveira Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Made available in DSpace on 2016-08-17T14:53:22Z (GMT). No. of bitstreams: 1 Dissertacao Allan James.pdf: 3170694 bytes, checksum: 054a9e74e81a7c2099800246d0b6c530 (MD5) Previous issue date: 2012-09-28 Coordenação de Aperfeiçoamento de Pessoal de Nível Superior The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system. A união das metodologias de controle ótimo e de programação dinâmica tem impulsionado o desenvolvimento de algoritmos para realizações de sistemas de controle discreto do tipo regulador linear quadrático (DLQR). A metodologia utilizada neste trabalho é fundamentada sobre métodos de aprendizagem por reforço baseados em diferenças temporais e programação dinâmica aproximada. O método proposto combina a aproximação da função valor através do método RLS (mínimos quadrados recursivos) e iteração de política aproximada em esquemas de programação dinâmica heurística (HDP). A abordagem é orientada para a avaliação da convergência da solução DLQR e para a sintonia heurística das matrizes de ponderação 􀜳 e 􀜴da função de utilidade associada ao DLQR. É realizada a investigação das propriedades de convergência relacionadas à consistência, excitação persistente e polarização do estimador RLS. A metodologia contempla realizações de projetos de forma online de controladores DLQR e é avaliada em um sistema dinâmico multivariável de quarta ordem. 2016-08-17T14:53:22Z 2013-04-03 2012-09-28 info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/masterThesis MACIEL, Allan James Ferreira. CONVERGENCE OF ESTIMATOR RLS FOR ALGORITHMS OF HEURISTIC DYNAMIC PROGRAMMING. 2012. 121 f. Dissertação (Mestrado em Engenharia) - Universidade Federal do Maranhão, São Luís, 2012. http://tedebc.ufma.br:8080/jspui/handle/tede/494 por info:eu-repo/semantics/openAccess application/pdf Universidade Federal do Maranhão PROGRAMA DE PÓS-GRADUAÇÃO EM ENGENHARIA DE ELETRICIDADE/CCET UFMA BR Engenharia reponame:Biblioteca Digital de Teses e Dissertações da UFMA instname:Universidade Federal do Maranhão instacron:UFMA
collection	NDLTD
language	Portuguese
format	Others
sources	NDLTD
topic	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
spellingShingle	Programação Dinâmica Heurística Controle Multivariável Controle Ótimo Regulador Quadrático Linear Discreto Mínimos Quadrados Recursivos Controle Digital Heuristic Dynamic Programming Multivariable Control Optimal Control Discrete Linear Quadratic Regulator Recursive Least Squares Digital Control CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO Maciel, Allan James Ferreira CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
description	Made available in DSpace on 2016-08-17T14:53:22Z (GMT). No. of bitstreams: 1 Dissertacao Allan James.pdf: 3170694 bytes, checksum: 054a9e74e81a7c2099800246d0b6c530 (MD5) Previous issue date: 2012-09-28 === Coordenação de Aperfeiçoamento de Pessoal de Nível Superior === The union of methodologies for optimal control and dynamics programming has stimulated the development of algorithms for realization of discrete control systems of the type linear quadratic regulator (DLQR). The methodology is based on reinforcement learning methods based on temporal differences and approximate dynamic programming. The proposed method combines the approach of the value function by method RLS (recursive least squares) and approximate policy iteration schemes heuristic dynamic programming (HDP). The approach is directed to the assessment of convergence of the solution DLQR and the heuristic weighting matrices 􀜳 and 􀜴 of the utility function associated with DLQR. The investigation of convergence properties related to consistency, persistent excitation and polarization of the RLS estimator is performed. The methodology involved in a project achievements online DLQR controllers and is evaluated in a fourth order multivariable dynamic system. === A união das metodologias de controle ótimo e de programação dinâmica tem impulsionado o desenvolvimento de algoritmos para realizações de sistemas de controle discreto do tipo regulador linear quadrático (DLQR). A metodologia utilizada neste trabalho é fundamentada sobre métodos de aprendizagem por reforço baseados em diferenças temporais e programação dinâmica aproximada. O método proposto combina a aproximação da função valor através do método RLS (mínimos quadrados recursivos) e iteração de política aproximada em esquemas de programação dinâmica heurística (HDP). A abordagem é orientada para a avaliação da convergência da solução DLQR e para a sintonia heurística das matrizes de ponderação 􀜳 e 􀜴da função de utilidade associada ao DLQR. É realizada a investigação das propriedades de convergência relacionadas à consistência, excitação persistente e polarização do estimador RLS. A metodologia contempla realizações de projetos de forma online de controladores DLQR e é avaliada em um sistema dinâmico multivariável de quarta ordem.
author2	Fonseca Neto, João Viana da
author_facet	Fonseca Neto, João Viana da Maciel, Allan James Ferreira
author	Maciel, Allan James Ferreira
author_sort	Maciel, Allan James Ferreira
title	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_short	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_full	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_fullStr	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_full_unstemmed	CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA
title_sort	convergência do estimador rls para algoritmos de programação dinâmica heurística
publisher	Universidade Federal do Maranhão
publishDate	2016
url	http://tedebc.ufma.br:8080/jspui/handle/tede/494
work_keys_str_mv	AT macielallanjamesferreira convergenciadoestimadorrlsparaalgoritmosdeprogramacaodinamicaheuristica AT macielallanjamesferreira convergenceofestimatorrlsforalgorithmsofheuristicdynamicprogramming
_version_	1718925675666079744

CONVERGÊNCIA DO ESTIMADOR RLS PARA ALGORITMOS DE PROGRAMAÇÃO DINÂMICA HEURÍSTICA

Similar Items