Machine Teaching for Human Inverse Reinforcement Learning

As robots continue to acquire useful skills, their ability to teach their expertise will provide humans the two-fold benefit of learning from robots and collaborating fluently with them. For example, robot tutors could teach handwriting to individual students and delivery robots could convey their n...

Full description

Bibliographic Details
Main Authors: Michael S. Lee, Henny Admoni, Reid Simmons
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-06-01
Series:Frontiers in Robotics and AI
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frobt.2021.693050/full
id doaj-d96fdfcda9934484b316e0d44cb3e40d
record_format Article
spelling doaj-d96fdfcda9934484b316e0d44cb3e40d2021-06-30T06:20:03ZengFrontiers Media S.A.Frontiers in Robotics and AI2296-91442021-06-01810.3389/frobt.2021.693050693050Machine Teaching for Human Inverse Reinforcement LearningMichael S. LeeHenny AdmoniReid SimmonsAs robots continue to acquire useful skills, their ability to teach their expertise will provide humans the two-fold benefit of learning from robots and collaborating fluently with them. For example, robot tutors could teach handwriting to individual students and delivery robots could convey their navigation conventions to better coordinate with nearby human workers. Because humans naturally communicate their behaviors through selective demonstrations, and comprehend others’ through reasoning that resembles inverse reinforcement learning (IRL), we propose a method of teaching humans based on demonstrations that are informative for IRL. But unlike prior work that optimizes solely for IRL, this paper incorporates various human teaching strategies (e.g. scaffolding, simplicity, pattern discovery, and testing) to better accommodate human learners. We assess our method with user studies and find that our measure of test difficulty corresponds well with human performance and confidence, and also find that favoring simplicity and pattern discovery increases human performance on difficult tests. However, we did not find a strong effect for our method of scaffolding, revealing shortcomings that indicate clear directions for future work.https://www.frontiersin.org/articles/10.3389/frobt.2021.693050/fullinverse reinforcement learninglearning from demonstrationscaffoldingpolicy summarizationmachine teaching
collection DOAJ
language English
format Article
sources DOAJ
author Michael S. Lee
Henny Admoni
Reid Simmons
spellingShingle Michael S. Lee
Henny Admoni
Reid Simmons
Machine Teaching for Human Inverse Reinforcement Learning
Frontiers in Robotics and AI
inverse reinforcement learning
learning from demonstration
scaffolding
policy summarization
machine teaching
author_facet Michael S. Lee
Henny Admoni
Reid Simmons
author_sort Michael S. Lee
title Machine Teaching for Human Inverse Reinforcement Learning
title_short Machine Teaching for Human Inverse Reinforcement Learning
title_full Machine Teaching for Human Inverse Reinforcement Learning
title_fullStr Machine Teaching for Human Inverse Reinforcement Learning
title_full_unstemmed Machine Teaching for Human Inverse Reinforcement Learning
title_sort machine teaching for human inverse reinforcement learning
publisher Frontiers Media S.A.
series Frontiers in Robotics and AI
issn 2296-9144
publishDate 2021-06-01
description As robots continue to acquire useful skills, their ability to teach their expertise will provide humans the two-fold benefit of learning from robots and collaborating fluently with them. For example, robot tutors could teach handwriting to individual students and delivery robots could convey their navigation conventions to better coordinate with nearby human workers. Because humans naturally communicate their behaviors through selective demonstrations, and comprehend others’ through reasoning that resembles inverse reinforcement learning (IRL), we propose a method of teaching humans based on demonstrations that are informative for IRL. But unlike prior work that optimizes solely for IRL, this paper incorporates various human teaching strategies (e.g. scaffolding, simplicity, pattern discovery, and testing) to better accommodate human learners. We assess our method with user studies and find that our measure of test difficulty corresponds well with human performance and confidence, and also find that favoring simplicity and pattern discovery increases human performance on difficult tests. However, we did not find a strong effect for our method of scaffolding, revealing shortcomings that indicate clear directions for future work.
topic inverse reinforcement learning
learning from demonstration
scaffolding
policy summarization
machine teaching
url https://www.frontiersin.org/articles/10.3389/frobt.2021.693050/full
work_keys_str_mv AT michaelslee machineteachingforhumaninversereinforcementlearning
AT hennyadmoni machineteachingforhumaninversereinforcementlearning
AT reidsimmons machineteachingforhumaninversereinforcementlearning
_version_ 1721353285715099648