A Guidance Law for Terminal Phase Exo-Atmospheric Interception Against a Maneuvering Target using Angle-Only Measurements Optimized using Reinforcement Meta-Learning

We present a novel guidance law that uses observations consisting solely of seeker line of sight angle measurements and their rate of change. The policy is optimized using reinforcement meta-learning and demonstrated in a simulated terminal phase of a mid-course exo-atmospheric interception. Importa...

Full description

Bibliographic Details
Main Authors: Gaudet, Brian (Author), Furfaro, Roberto (Author), Linares, Richard (Author)
Format: Article
Language:English
Published: American Institute of Aeronautics and Astronautics, 2022-03-21T14:36:11Z.
Subjects:
Online Access:Get fulltext
LEADER 01586 am a22001693u 4500
001 137649.2
042 |a dc 
100 1 0 |a Gaudet, Brian  |e author 
700 1 0 |a Furfaro, Roberto  |e author 
700 1 0 |a Linares, Richard  |e author 
245 0 0 |a A Guidance Law for Terminal Phase Exo-Atmospheric Interception Against a Maneuvering Target using Angle-Only Measurements Optimized using Reinforcement Meta-Learning 
260 |b American Institute of Aeronautics and Astronautics,   |c 2022-03-21T14:36:11Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/137649.2 
520 |a We present a novel guidance law that uses observations consisting solely of seeker line of sight angle measurements and their rate of change. The policy is optimized using reinforcement meta-learning and demonstrated in a simulated terminal phase of a mid-course exo-atmospheric interception. Importantly, the guidance law does not require range estimation, making it particularly suitable for passive seekers. The optimized policy maps stabilized seeker line of sight angles and their rate of change directly to commanded thrust for the mis-sile's divert thrusters. The use of reinforcement meta-learning allows the optimized policy to adapt to target acceleration, and we demonstrate that the policy has superior performance as compared to augmented zero-effort miss guidance with perfect target acceleration knowledge. The optimized policy is computationally efficient and requires minimal memory, and should be compatible with today's flight processors. 
546 |a en 
655 7 |a Article 
773 |t AIAA Scitech 2020 Forum