Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition
We consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2020-04-01
|
Series: | Frontiers in Robotics and AI |
Subjects: | |
Online Access: | https://www.frontiersin.org/article/10.3389/frobt.2020.00042/full |
id |
doaj-5a91e6e4ed3a4c8eb5b2b0ed9acd922c |
---|---|
record_format |
Article |
spelling |
doaj-5a91e6e4ed3a4c8eb5b2b0ed9acd922c2020-11-25T01:31:37ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442020-04-01710.3389/frobt.2020.00042467223Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior CompositionSimon Hangl0Vedran Dunjko1Hans J. Briegel2Justus Piater3Intelligent and Interactive Systems, Department of Informatics, University of Innsbruck, Innsbruck, AustriaLIACS, Leiden University, Leiden, NetherlandsInstitute for Theoretical Physics, University of Innsbruck, Innsbruck, AustriaIntelligent and Interactive Systems, Department of Informatics, University of Innsbruck, Innsbruck, AustriaWe consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and behaviors, the robot learns how to prepare situations for which a successful strategy is already known. The information gathered during autonomous play is additionally used to train an environment model. This model is exploited for active learning and the generation of novel preparatory behaviors compositions. We apply our approach to a wide range of different manipulation tasks, e.g., book grasping, grasping of objects of different sizes by selecting different grasping strategies, placement on shelves, and tower disassembly. We show that the composite behavior generation mechanism enables the robot to solve previously-unsolvable tasks, e.g., tower disassembly. We use success statistics gained during real-world experiments to simulate the convergence behavior of our system. Simulation experiments show that the learning speed can be improved by around 30% by using active learning.https://www.frontiersin.org/article/10.3389/frobt.2020.00042/fullactive learninghierarchical modelsskill learningreinforcement learningautonomous roboticsrobotic manipulation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Simon Hangl Vedran Dunjko Hans J. Briegel Justus Piater |
spellingShingle |
Simon Hangl Vedran Dunjko Hans J. Briegel Justus Piater Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition Frontiers in Robotics and AI active learning hierarchical models skill learning reinforcement learning autonomous robotics robotic manipulation |
author_facet |
Simon Hangl Vedran Dunjko Hans J. Briegel Justus Piater |
author_sort |
Simon Hangl |
title |
Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition |
title_short |
Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition |
title_full |
Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition |
title_fullStr |
Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition |
title_full_unstemmed |
Skill Learning by Autonomous Robotic Playing Using Active Learning and Exploratory Behavior Composition |
title_sort |
skill learning by autonomous robotic playing using active learning and exploratory behavior composition |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Robotics and AI |
issn |
2296-9144 |
publishDate |
2020-04-01 |
description |
We consider the problem of autonomous acquisition of manipulation skills where problem-solving strategies are initially available only for a narrow range of situations. We propose to extend the range of solvable situations by autonomous play with the object. By applying previously-trained skills and behaviors, the robot learns how to prepare situations for which a successful strategy is already known. The information gathered during autonomous play is additionally used to train an environment model. This model is exploited for active learning and the generation of novel preparatory behaviors compositions. We apply our approach to a wide range of different manipulation tasks, e.g., book grasping, grasping of objects of different sizes by selecting different grasping strategies, placement on shelves, and tower disassembly. We show that the composite behavior generation mechanism enables the robot to solve previously-unsolvable tasks, e.g., tower disassembly. We use success statistics gained during real-world experiments to simulate the convergence behavior of our system. Simulation experiments show that the learning speed can be improved by around 30% by using active learning. |
topic |
active learning hierarchical models skill learning reinforcement learning autonomous robotics robotic manipulation |
url |
https://www.frontiersin.org/article/10.3389/frobt.2020.00042/full |
work_keys_str_mv |
AT simonhangl skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition AT vedrandunjko skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition AT hansjbriegel skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition AT justuspiater skilllearningbyautonomousroboticplayingusingactivelearningandexploratorybehaviorcomposition |
_version_ |
1725085614050639872 |