Machine learning and dynamic programming algorithms for motion planning and control

Robot motion planning is one of the central problems in robotics, and has received considerable amount of attention not only from roboticists but also from the control and artificial intelligence (AI) communities. Despite the different types of applications and physical properties of robotic systems...

Full description

Bibliographic Details
Main Author: Arslan, Oktay
Other Authors: Tsiotras, Panagiotis
Format: Others
Language:en_US
Published: Georgia Institute of Technology 2016
Subjects:
Online Access:http://hdl.handle.net/1853/54317
id ndltd-GATECH-oai-smartech.gatech.edu-1853-54317
record_format oai_dc
spelling ndltd-GATECH-oai-smartech.gatech.edu-1853-543172016-01-27T03:34:29ZMachine learning and dynamic programming algorithms for motion planning and controlArslan, OktayRobotic motion planningSampling-based algorithmsRapidly-exploring random treesDynamic programmingMachine learningClosed-loop predictionHigh-level route planningRobot motion planning is one of the central problems in robotics, and has received considerable amount of attention not only from roboticists but also from the control and artificial intelligence (AI) communities. Despite the different types of applications and physical properties of robotic systems, many high-level tasks of autonomous systems can be decomposed into subtasks which require point-to-point navigation while avoiding infeasible regions due to the obstacles in the workspace. This dissertation aims at developing a new class of sampling-based motion planning algorithms that are fast, efficient and asymptotically optimal by employing ideas from Machine Learning (ML) and Dynamic Programming (DP). First, we interpret the robot motion planning problem as a form of a machine learning problem since the underlying search space is not known a priori, and utilize random geometric graphs to compute consistent discretizations of the underlying continuous search space. Then, we integrate existing DP algorithms and ML algorithms to the framework of sampling-based algorithms for better exploitation and exploration, respectively. We introduce a novel sampling-based algorithm, called RRT#, that improves upon the well-known RRT* algorithm by leveraging value and policy iteration methods as new information is collected. The proposed algorithms yield provable guarantees on correctness, completeness and asymptotic optimality. We also develop an adaptive sampling strategy by considering exploration as a classification (or regression) problem, and use online machine learning algorithms to learn the relevant region of a query, i.e., the region that contains the optimal solution, without significant computational overhead. We then extend the application of sampling-based algorithms to a class of stochastic optimal control problems and problems with differential constraints. Specifically, we introduce the Path Integral - RRT algorithm, for solving optimal control of stochastic systems and the CL-RRT# algorithm that uses closed-loop prediction for trajectory generation for differential systems. One of the key benefits of CL-RRT# is that for many systems, given a low-level tracking controller, it is easier to handle differential constraints, so complex steering procedures are not needed, unlike most existing kinodynamic sampling-based algorithms. Implementation results of sampling-based planners for route planning of a full-scale autonomous helicopter under the Autonomous Aerial Cargo/Utility System Program (AACUS) program are provided.Georgia Institute of TechnologyTsiotras, Panagiotis2016-01-07T17:22:27Z2016-01-07T17:22:27Z2015-122015-11-16December 20152016-01-07T17:22:27ZDissertationapplication/pdfhttp://hdl.handle.net/1853/54317en_US
collection NDLTD
language en_US
format Others
sources NDLTD
topic Robotic motion planning
Sampling-based algorithms
Rapidly-exploring random trees
Dynamic programming
Machine learning
Closed-loop prediction
High-level route planning
spellingShingle Robotic motion planning
Sampling-based algorithms
Rapidly-exploring random trees
Dynamic programming
Machine learning
Closed-loop prediction
High-level route planning
Arslan, Oktay
Machine learning and dynamic programming algorithms for motion planning and control
description Robot motion planning is one of the central problems in robotics, and has received considerable amount of attention not only from roboticists but also from the control and artificial intelligence (AI) communities. Despite the different types of applications and physical properties of robotic systems, many high-level tasks of autonomous systems can be decomposed into subtasks which require point-to-point navigation while avoiding infeasible regions due to the obstacles in the workspace. This dissertation aims at developing a new class of sampling-based motion planning algorithms that are fast, efficient and asymptotically optimal by employing ideas from Machine Learning (ML) and Dynamic Programming (DP). First, we interpret the robot motion planning problem as a form of a machine learning problem since the underlying search space is not known a priori, and utilize random geometric graphs to compute consistent discretizations of the underlying continuous search space. Then, we integrate existing DP algorithms and ML algorithms to the framework of sampling-based algorithms for better exploitation and exploration, respectively. We introduce a novel sampling-based algorithm, called RRT#, that improves upon the well-known RRT* algorithm by leveraging value and policy iteration methods as new information is collected. The proposed algorithms yield provable guarantees on correctness, completeness and asymptotic optimality. We also develop an adaptive sampling strategy by considering exploration as a classification (or regression) problem, and use online machine learning algorithms to learn the relevant region of a query, i.e., the region that contains the optimal solution, without significant computational overhead. We then extend the application of sampling-based algorithms to a class of stochastic optimal control problems and problems with differential constraints. Specifically, we introduce the Path Integral - RRT algorithm, for solving optimal control of stochastic systems and the CL-RRT# algorithm that uses closed-loop prediction for trajectory generation for differential systems. One of the key benefits of CL-RRT# is that for many systems, given a low-level tracking controller, it is easier to handle differential constraints, so complex steering procedures are not needed, unlike most existing kinodynamic sampling-based algorithms. Implementation results of sampling-based planners for route planning of a full-scale autonomous helicopter under the Autonomous Aerial Cargo/Utility System Program (AACUS) program are provided.
author2 Tsiotras, Panagiotis
author_facet Tsiotras, Panagiotis
Arslan, Oktay
author Arslan, Oktay
author_sort Arslan, Oktay
title Machine learning and dynamic programming algorithms for motion planning and control
title_short Machine learning and dynamic programming algorithms for motion planning and control
title_full Machine learning and dynamic programming algorithms for motion planning and control
title_fullStr Machine learning and dynamic programming algorithms for motion planning and control
title_full_unstemmed Machine learning and dynamic programming algorithms for motion planning and control
title_sort machine learning and dynamic programming algorithms for motion planning and control
publisher Georgia Institute of Technology
publishDate 2016
url http://hdl.handle.net/1853/54317
work_keys_str_mv AT arslanoktay machinelearninganddynamicprogrammingalgorithmsformotionplanningandcontrol
_version_ 1718161743125938176