Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking

Enabling coordinated motion of multiple quadrotors is an active area of research in the field of small unmanned aerial vehicles (sUAVs). While there are many techniques found in the literature that address the problem, these studies are limited to simulation results and seldom account for wind distu...

Full description

Bibliographic Details
Main Authors:	Pramod Abichandani, Christian Speck, Donald Bucci, William Mcintyre, Deepan Lobo
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Cooperative systems design for experiments unmanned aerial vehicles multi-agent systems motion planning supervised learning
Online Access:	https://ieeexplore.ieee.org/document/9548090/

id	doaj-17ad747cbc134e00b5da2838af116023
record_format	Article
spelling	doaj-17ad747cbc134e00b5da2838af1160232021-10-01T23:01:30ZengIEEEIEEE Access2169-35362021-01-01913249113250710.1109/ACCESS.2021.31157119548090Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor FlockingPramod Abichandani0https://orcid.org/0000-0003-3572-4458Christian Speck1Donald Bucci2William Mcintyre3Deepan Lobo4https://orcid.org/0000-0002-7434-8013Department of Electrical and Computer Engineering, Newark College of Engineering (NCE), Robotics and Data Laboratory (RADLab), New Jersey Institute of Technology, Newark, NJ, USALockheed Martin Advanced Technology Laboratories, Cherry Hill, NJ, USALockheed Martin Advanced Technology Laboratories, Cherry Hill, NJ, USADepartment of Electrical and Computer Engineering, Newark College of Engineering (NCE), Robotics and Data Laboratory (RADLab), New Jersey Institute of Technology, Newark, NJ, USADepartment of Electrical and Computer Engineering, Newark College of Engineering (NCE), Robotics and Data Laboratory (RADLab), New Jersey Institute of Technology, Newark, NJ, USAEnabling coordinated motion of multiple quadrotors is an active area of research in the field of small unmanned aerial vehicles (sUAVs). While there are many techniques found in the literature that address the problem, these studies are limited to simulation results and seldom account for wind disturbances. This paper presents the experimental validation of a decentralized planner based on multi-objective reinforcement learning (RL) that achieves waypoint-based flocking (separation, velocity alignment, and cohesion) for multiple quadrotors in the presence of wind gusts. The planner is learned using an object-focused, greatest mass, state-action-reward-state-action (OF-GM-SARSA) approach. The Dryden wind gust model is used to simulate wind gusts during hardware-in-the-loop (HWIL) tests. The hardware and software architecture developed for the multi-quadrotor flocking controller is described in detail. HWIL and outdoor flight tests results show that the trained RL planner can generalize the flocking behaviors learned in training to the real-world flight dynamics of the DJI M100 quadrotor in windy conditions.https://ieeexplore.ieee.org/document/9548090/Cooperative systemsdesign for experimentsunmanned aerial vehiclesmulti-agent systemsmotion planningsupervised learning
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Pramod Abichandani Christian Speck Donald Bucci William Mcintyre Deepan Lobo
spellingShingle	Pramod Abichandani Christian Speck Donald Bucci William Mcintyre Deepan Lobo Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking IEEE Access Cooperative systems design for experiments unmanned aerial vehicles multi-agent systems motion planning supervised learning
author_facet	Pramod Abichandani Christian Speck Donald Bucci William Mcintyre Deepan Lobo
author_sort	Pramod Abichandani
title	Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking
title_short	Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking
title_full	Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking
title_fullStr	Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking
title_full_unstemmed	Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking
title_sort	implementation of decentralized reinforcement learning-based multi-quadrotor flocking
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	Enabling coordinated motion of multiple quadrotors is an active area of research in the field of small unmanned aerial vehicles (sUAVs). While there are many techniques found in the literature that address the problem, these studies are limited to simulation results and seldom account for wind disturbances. This paper presents the experimental validation of a decentralized planner based on multi-objective reinforcement learning (RL) that achieves waypoint-based flocking (separation, velocity alignment, and cohesion) for multiple quadrotors in the presence of wind gusts. The planner is learned using an object-focused, greatest mass, state-action-reward-state-action (OF-GM-SARSA) approach. The Dryden wind gust model is used to simulate wind gusts during hardware-in-the-loop (HWIL) tests. The hardware and software architecture developed for the multi-quadrotor flocking controller is described in detail. HWIL and outdoor flight tests results show that the trained RL planner can generalize the flocking behaviors learned in training to the real-world flight dynamics of the DJI M100 quadrotor in windy conditions.
topic	Cooperative systems design for experiments unmanned aerial vehicles multi-agent systems motion planning supervised learning
url	https://ieeexplore.ieee.org/document/9548090/
work_keys_str_mv	AT pramodabichandani implementationofdecentralizedreinforcementlearningbasedmultiquadrotorflocking AT christianspeck implementationofdecentralizedreinforcementlearningbasedmultiquadrotorflocking AT donaldbucci implementationofdecentralizedreinforcementlearningbasedmultiquadrotorflocking AT williammcintyre implementationofdecentralizedreinforcementlearningbasedmultiquadrotorflocking AT deepanlobo implementationofdecentralizedreinforcementlearningbasedmultiquadrotorflocking
_version_	1716860661726707712

Implementation of Decentralized Reinforcement Learning-Based Multi-Quadrotor Flocking

Similar Items