Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft

Classical gradient-based approximate dynamic programming approaches provide reliable and fast solution platforms for various optimal control problems. However, their dependence on accurate modeling approaches poses a major concern, where the efficiency of the proposed solutions are severely degraded...

Full description

Bibliographic Details
Main Authors:	Mohammed Abouheaf, Wail Gueaieb, Frank Lewis
Format:	Article
Language:	English
Published:	MDPI AG 2018-10-01
Series:	Robotics
Subjects:	model-free control flexible wing aircraft reinforcement learning optimal control
Online Access:	https://www.mdpi.com/2218-6581/7/4/66

id	doaj-e53a682bed4245b280aa9422a6f8f685
record_format	Article
spelling	doaj-e53a682bed4245b280aa9422a6f8f6852020-11-25T00:49:11ZengMDPI AGRobotics2218-65812018-10-01746610.3390/robotics7040066robotics7040066Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing AircraftMohammed Abouheaf0Wail Gueaieb1Frank Lewis2School of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, CanadaSchool of Electrical Engineering and Computer Science, University of Ottawa, Ottawa, ON K1N 6N5, CanadaDepartment of Electrical Engineering, University of Texas at Arlington, Arlington, TX 76019, USAClassical gradient-based approximate dynamic programming approaches provide reliable and fast solution platforms for various optimal control problems. However, their dependence on accurate modeling approaches poses a major concern, where the efficiency of the proposed solutions are severely degraded in the case of uncertain dynamical environments. Herein, a novel online adaptive learning framework is introduced to solve action-dependent dual heuristic dynamic programming problems. The approach does not depend on the dynamical models of the considered systems. Instead, it employs optimization principles to produce model-free control strategies. A policy iteration process is employed to solve the underlying Hamilton⁻Jacobi⁻Bellman equation using means of adaptive critics, where a layer of separate actor-critic neural networks is employed along with gradient descent adaptation rules. A Riccati development is introduced and shown to be equivalent to solving the underlying Hamilton⁻Jacobi⁻Bellman equation. The proposed approach is applied on the challenging weight shift control problem of a flexible wing aircraft. The continuous nonlinear deformation in the aircraft’s flexible wing leads to various aerodynamic variations at different trim speeds, which makes its auto-pilot control a complicated task. Series of numerical simulations were carried out to demonstrate the effectiveness of the suggested strategy.https://www.mdpi.com/2218-6581/7/4/66model-free controlflexible wing aircraftreinforcement learningoptimal control
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Mohammed Abouheaf Wail Gueaieb Frank Lewis
spellingShingle	Mohammed Abouheaf Wail Gueaieb Frank Lewis Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft Robotics model-free control flexible wing aircraft reinforcement learning optimal control
author_facet	Mohammed Abouheaf Wail Gueaieb Frank Lewis
author_sort	Mohammed Abouheaf
title	Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft
title_short	Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft
title_full	Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft
title_fullStr	Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft
title_full_unstemmed	Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft
title_sort	model-free gradient-based adaptive learning controller for an unmanned flexible wing aircraft
publisher	MDPI AG
series	Robotics
issn	2218-6581
publishDate	2018-10-01
description	Classical gradient-based approximate dynamic programming approaches provide reliable and fast solution platforms for various optimal control problems. However, their dependence on accurate modeling approaches poses a major concern, where the efficiency of the proposed solutions are severely degraded in the case of uncertain dynamical environments. Herein, a novel online adaptive learning framework is introduced to solve action-dependent dual heuristic dynamic programming problems. The approach does not depend on the dynamical models of the considered systems. Instead, it employs optimization principles to produce model-free control strategies. A policy iteration process is employed to solve the underlying Hamilton⁻Jacobi⁻Bellman equation using means of adaptive critics, where a layer of separate actor-critic neural networks is employed along with gradient descent adaptation rules. A Riccati development is introduced and shown to be equivalent to solving the underlying Hamilton⁻Jacobi⁻Bellman equation. The proposed approach is applied on the challenging weight shift control problem of a flexible wing aircraft. The continuous nonlinear deformation in the aircraft’s flexible wing leads to various aerodynamic variations at different trim speeds, which makes its auto-pilot control a complicated task. Series of numerical simulations were carried out to demonstrate the effectiveness of the suggested strategy.
topic	model-free control flexible wing aircraft reinforcement learning optimal control
url	https://www.mdpi.com/2218-6581/7/4/66
work_keys_str_mv	AT mohammedabouheaf modelfreegradientbasedadaptivelearningcontrollerforanunmannedflexiblewingaircraft AT wailgueaieb modelfreegradientbasedadaptivelearningcontrollerforanunmannedflexiblewingaircraft AT franklewis modelfreegradientbasedadaptivelearningcontrollerforanunmannedflexiblewingaircraft
_version_	1725252589645201408

Model-Free Gradient-Based Adaptive Learning Controller for an Unmanned Flexible Wing Aircraft

Similar Items