Machine Teaching for Human Inverse Reinforcement Learning
As robots continue to acquire useful skills, their ability to teach their expertise will provide humans the two-fold benefit of learning from robots and collaborating fluently with them. For example, robot tutors could teach handwriting to individual students and delivery robots could convey their n...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-06-01
|
Series: | Frontiers in Robotics and AI |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frobt.2021.693050/full |
id |
doaj-d96fdfcda9934484b316e0d44cb3e40d |
---|---|
record_format |
Article |
spelling |
doaj-d96fdfcda9934484b316e0d44cb3e40d2021-06-30T06:20:03ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442021-06-01810.3389/frobt.2021.693050693050Machine Teaching for Human Inverse Reinforcement LearningMichael S. LeeHenny AdmoniReid SimmonsAs robots continue to acquire useful skills, their ability to teach their expertise will provide humans the two-fold benefit of learning from robots and collaborating fluently with them. For example, robot tutors could teach handwriting to individual students and delivery robots could convey their navigation conventions to better coordinate with nearby human workers. Because humans naturally communicate their behaviors through selective demonstrations, and comprehend others’ through reasoning that resembles inverse reinforcement learning (IRL), we propose a method of teaching humans based on demonstrations that are informative for IRL. But unlike prior work that optimizes solely for IRL, this paper incorporates various human teaching strategies (e.g. scaffolding, simplicity, pattern discovery, and testing) to better accommodate human learners. We assess our method with user studies and find that our measure of test difficulty corresponds well with human performance and confidence, and also find that favoring simplicity and pattern discovery increases human performance on difficult tests. However, we did not find a strong effect for our method of scaffolding, revealing shortcomings that indicate clear directions for future work.https://www.frontiersin.org/articles/10.3389/frobt.2021.693050/fullinverse reinforcement learninglearning from demonstrationscaffoldingpolicy summarizationmachine teaching |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Michael S. Lee Henny Admoni Reid Simmons |
spellingShingle |
Michael S. Lee Henny Admoni Reid Simmons Machine Teaching for Human Inverse Reinforcement Learning Frontiers in Robotics and AI inverse reinforcement learning learning from demonstration scaffolding policy summarization machine teaching |
author_facet |
Michael S. Lee Henny Admoni Reid Simmons |
author_sort |
Michael S. Lee |
title |
Machine Teaching for Human Inverse Reinforcement Learning |
title_short |
Machine Teaching for Human Inverse Reinforcement Learning |
title_full |
Machine Teaching for Human Inverse Reinforcement Learning |
title_fullStr |
Machine Teaching for Human Inverse Reinforcement Learning |
title_full_unstemmed |
Machine Teaching for Human Inverse Reinforcement Learning |
title_sort |
machine teaching for human inverse reinforcement learning |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Robotics and AI |
issn |
2296-9144 |
publishDate |
2021-06-01 |
description |
As robots continue to acquire useful skills, their ability to teach their expertise will provide humans the two-fold benefit of learning from robots and collaborating fluently with them. For example, robot tutors could teach handwriting to individual students and delivery robots could convey their navigation conventions to better coordinate with nearby human workers. Because humans naturally communicate their behaviors through selective demonstrations, and comprehend others’ through reasoning that resembles inverse reinforcement learning (IRL), we propose a method of teaching humans based on demonstrations that are informative for IRL. But unlike prior work that optimizes solely for IRL, this paper incorporates various human teaching strategies (e.g. scaffolding, simplicity, pattern discovery, and testing) to better accommodate human learners. We assess our method with user studies and find that our measure of test difficulty corresponds well with human performance and confidence, and also find that favoring simplicity and pattern discovery increases human performance on difficult tests. However, we did not find a strong effect for our method of scaffolding, revealing shortcomings that indicate clear directions for future work. |
topic |
inverse reinforcement learning learning from demonstration scaffolding policy summarization machine teaching |
url |
https://www.frontiersin.org/articles/10.3389/frobt.2021.693050/full |
work_keys_str_mv |
AT michaelslee machineteachingforhumaninversereinforcementlearning AT hennyadmoni machineteachingforhumaninversereinforcementlearning AT reidsimmons machineteachingforhumaninversereinforcementlearning |
_version_ |
1721353285715099648 |