Multi-object aerodynamic design optimization using deep reinforcement learning

Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially...

Full description

Bibliographic Details
Main Authors: Xinyu Hui, Hui Wang, Wenqiang Li, Junqiang Bai, Fei Qin, Guoqiang He
Format: Article
Language:English
Published: AIP Publishing LLC 2021-08-01
Series:AIP Advances
Online Access:http://dx.doi.org/10.1063/5.0058088
id doaj-2d06a9c65b5a46279493317440c2820f
record_format Article
spelling doaj-2d06a9c65b5a46279493317440c2820f2021-09-03T11:18:12ZengAIP Publishing LLCAIP Advances2158-32262021-08-01118085311085311-910.1063/5.0058088Multi-object aerodynamic design optimization using deep reinforcement learningXinyu Hui0Hui Wang1Wenqiang Li2Junqiang Bai3Fei Qin4Guoqiang He5School of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, ChinaSchool of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, ChinaKey Laboratory of Science and Technology on Combustion, Internal Flow and Thermal-Structure, Northwestern Polytechnical University, Xi’an 710072, ChinaSchool of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, ChinaKey Laboratory of Science and Technology on Combustion, Internal Flow and Thermal-Structure, Northwestern Polytechnical University, Xi’an 710072, ChinaKey Laboratory of Science and Technology on Combustion, Internal Flow and Thermal-Structure, Northwestern Polytechnical University, Xi’an 710072, ChinaAerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%–10.1% improvements of the aerodynamic performance compared with the results of baseline.http://dx.doi.org/10.1063/5.0058088
collection DOAJ
language English
format Article
sources DOAJ
author Xinyu Hui
Hui Wang
Wenqiang Li
Junqiang Bai
Fei Qin
Guoqiang He
spellingShingle Xinyu Hui
Hui Wang
Wenqiang Li
Junqiang Bai
Fei Qin
Guoqiang He
Multi-object aerodynamic design optimization using deep reinforcement learning
AIP Advances
author_facet Xinyu Hui
Hui Wang
Wenqiang Li
Junqiang Bai
Fei Qin
Guoqiang He
author_sort Xinyu Hui
title Multi-object aerodynamic design optimization using deep reinforcement learning
title_short Multi-object aerodynamic design optimization using deep reinforcement learning
title_full Multi-object aerodynamic design optimization using deep reinforcement learning
title_fullStr Multi-object aerodynamic design optimization using deep reinforcement learning
title_full_unstemmed Multi-object aerodynamic design optimization using deep reinforcement learning
title_sort multi-object aerodynamic design optimization using deep reinforcement learning
publisher AIP Publishing LLC
series AIP Advances
issn 2158-3226
publishDate 2021-08-01
description Aerodynamic design optimization is a key aspect in aircraft design. The further evolution of advanced aircraft derivatives requires a powerful optimization toolbox. Reinforcement learning (RL) is a powerful optimization tool but has rarely been utilized in the aerodynamic design. It can potentially obtain results similar to those of a human designer, by accumulating experience from training. In this work, a popular RL method called proximal policy optimization (PPO) is proposed to investigate multi-object aerodynamic design optimization. By observing the aerodynamic performances of different airfoils, the PPO updates a reasonable policy to generate the optimal airfoils in a single step. In a Pareto optimization problem with constraints, the PPO requires only 15% of the computational time of the non-dominated sorted genetic algorithm (II) to achieve the same accuracy. The results from testing show that the agent learns a policy that can achieve ∼4.3%–10.1% improvements of the aerodynamic performance compared with the results of baseline.
url http://dx.doi.org/10.1063/5.0058088
work_keys_str_mv AT xinyuhui multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
AT huiwang multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
AT wenqiangli multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
AT junqiangbai multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
AT feiqin multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
AT guoqianghe multiobjectaerodynamicdesignoptimizationusingdeepreinforcementlearning
_version_ 1717817416401027072