Multi-object aerodynamic design optimization using deep reinforcement learning

Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially...

Full description

Bibliographic Details
Main Authors:	Xinyu Hui, Hui Wang, Wenqiang Li, Junqiang Bai, Fei Qin, Guoqiang He
Format:	Article
Language:	English
Published:	AIP Publishing LLC 2021-08-01
Series:	AIP Advances
Online Access:	http://dx.doi.org/10.1063/5.0058088

id	doaj-2d06a9c65b5a46279493317440c2820f
record_format	Article
spelling	doaj-2d06a9c65b5a46279493317440c2820f2021-09-03T11:18:12ZengAIP Publishing LLCAIP Advances2158-32262021-08-01118085311085311-910.1063/5.0058088Multi-object aerodynamic design optimization using deep reinforcement learningXinyu Hui0Hui Wang1Wenqiang Li2Junqiang Bai3Fei Qin4Guoqiang He5School of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, ChinaSchool of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, ChinaKey Laboratory of Science and Technology on Combustion, Internal Flow and Thermal-Structure, Northwestern Polytechnical University, Xi’an 710072, ChinaSchool of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, ChinaKey Laboratory of Science and Technology on Combustion, Internal Flow and Thermal-Structure, Northwestern Polytechnical University, Xi’an 710072, ChinaKey Laboratory of Science and Technology on Combustion, Internal Flow and Thermal-Structure, Northwestern Polytechnical University, Xi’an 710072, ChinaAerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%–10.1% improvements of the aerodynamic performance compared with the results of baseline.http://dx.doi.org/10.1063/5.0058088
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xinyu Hui Hui Wang Wenqiang Li Junqiang Bai Fei Qin Guoqiang He
spellingShingle	Xinyu Hui Hui Wang Wenqiang Li Junqiang Bai Fei Qin Guoqiang He Multi-object aerodynamic design optimization using deep reinforcement learning AIP Advances
author_facet	Xinyu Hui Hui Wang Wenqiang Li Junqiang Bai Fei Qin Guoqiang He
author_sort	Xinyu Hui
title	Multi-object aerodynamic design optimization using deep reinforcement learning
title_short	Multi-object aerodynamic design optimization using deep reinforcement learning
title_full	Multi-object aerodynamic design optimization using deep reinforcement learning
title_fullStr	Multi-object aerodynamic design optimization using deep reinforcement learning
title_full_unstemmed	Multi-object aerodynamic design optimization using deep reinforcement learning
title_sort	multi-object aerodynamic design optimization using deep reinforcement learning
publisher	AIP Publishing LLC
series	AIP Advances
issn	2158-3226
publishDate	2021-08-01
description	Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%–10.1% improvements of the aerodynamic performance compared with the results of baseline.
url	http://dx.doi.org/10.1063/5.0058088
work_keys_str_mv	AT xinyuhui multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning AT huiwang multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning AT wenqiangli multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning AT junqiangbai multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning AT feiqin multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning AT guoqianghe multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
_version_	1717817416401027072

Multi-object aerodynamic design optimization using deep reinforcement learning

Similar Items