Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition

We consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and...

Full description

Bibliographic Details
Main Authors: Simon Hangl, Vedran Dunjko, Hans J. Briegel, Justus Piater
Format: Article
Language:English
Published: Frontiers Media S.A. 2020-04-01
Series:Frontiers in Robotics and AI
Subjects:
Online Access:https://www.frontiersin.org/article/10.3389/frobt.2020.00042/full
id doaj-5a91e6e4ed3a4c8eb5b2b0ed9acd922c
record_format Article
spelling doaj-5a91e6e4ed3a4c8eb5b2b0ed9acd922c2020-11-25T01:31:37ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442020-04-01710.3389/frobt.2020.00042467223Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior CompositionSimon Hangl0Vedran Dunjko1Hans J. Briegel2Justus Piater3Intelligent and Interactive Systems, Department of Informatics, University of Innsbruck, Innsbruck, AustriaLIACS, Leiden University, Leiden, NetherlandsInstitute for Theoretical Physics, University of Innsbruck, Innsbruck, AustriaIntelligent and Interactive Systems, Department of Informatics, University of Innsbruck, Innsbruck, AustriaWe consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and behaviors, the robot learns how to prepare situations for which a successful strategy is already known. The information gathered during autonomous play is additionally used to train an environment model. This model is exploited for active learning and the generation of novel preparatory behaviors compositions. We apply our approach to a wide range of different manipulation tasks, e.g., book grasping, grasping of objects of different sizes by selecting different grasping strategies, placement on shelves, and tower disassembly. We show that the composite behavior generation mechanism enables the robot to solve previously-unsolvable tasks, e.g., tower disassembly. We use success statistics gained during real-world experiments to simulate the convergence behavior of our system. Simulation experiments show that the learning speed can be improved by around 30% by using active learning.https://www.frontiersin.org/article/10.3389/frobt.2020.00042/fullactive learninghierarchical modelsskill learningreinforcement learningautonomous roboticsrobotic manipulation
collection DOAJ
language English
format Article
sources DOAJ
author Simon Hangl
Vedran Dunjko
Hans J. Briegel
Justus Piater
spellingShingle Simon Hangl
Vedran Dunjko
Hans J. Briegel
Justus Piater
Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
Frontiers in Robotics and AI
active learning
hierarchical models
skill learning
reinforcement learning
autonomous robotics
robotic manipulation
author_facet Simon Hangl
Vedran Dunjko
Hans J. Briegel
Justus Piater
author_sort Simon Hangl
title Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
title_short Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
title_full Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
title_fullStr Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
title_full_unstemmed Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
title_sort skill learning by autonomous robotic playing using active learning and exploratory behavior composition
publisher Frontiers Media S.A.
series Frontiers in Robotics and AI
issn 2296-9144
publishDate 2020-04-01
description We consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and behaviors, the robot learns how to prepare situations for which a successful strategy is already known. The information gathered during autonomous play is additionally used to train an environment model. This model is exploited for active learning and the generation of novel preparatory behaviors compositions. We apply our approach to a wide range of different manipulation tasks, e.g., book grasping, grasping of objects of different sizes by selecting different grasping strategies, placement on shelves, and tower disassembly. We show that the composite behavior generation mechanism enables the robot to solve previously-unsolvable tasks, e.g., tower disassembly. We use success statistics gained during real-world experiments to simulate the convergence behavior of our system. Simulation experiments show that the learning speed can be improved by around 30% by using active learning.
topic active learning
hierarchical models
skill learning
reinforcement learning
autonomous robotics
robotic manipulation
url https://www.frontiersin.org/article/10.3389/frobt.2020.00042/full
work_keys_str_mv AT simonhangl skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition
AT vedrandunjko skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition
AT hansjbriegel skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition
AT justuspiater skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition
_version_ 1725085614050639872