Learning to stop: a unifying principle for legged locomotion in varying environments

Evolutionary studies have unequivocally proven the transition of living organisms from water to land. Consequently, it can be deduced that locomotion strategies must have evolved from one environment to the other. However, the mechanism by which this transition happened and its implications on bio-m...

Full description

Bibliographic Details
Main Authors: Thomas George Thuruthel, G. Picardi, F. Iida, C. Laschi, M. Calisti
Format: Article
Language:English
Published: The Royal Society 2021-04-01
Series:Royal Society Open Science
Subjects:
Online Access:https://royalsocietypublishing.org/doi/10.1098/rsos.210223
id doaj-bfb780533f4f48d58145528981769c12
record_format Article
spelling doaj-bfb780533f4f48d58145528981769c122021-06-10T08:57:26ZengThe Royal SocietyRoyal Society Open Science2054-57032021-04-018410.1098/rsos.210223Learning to stop: a unifying principle for legged locomotion in varying environmentsThomas George Thuruthel0G. Picardi1F. Iida2C. Laschi3M. Calisti4Bio-Inspired Robotics Laboratory, Department of Engineering, University of Cambridge, Cambridge, UKThe BioRobotics Institute, Scuola Superiore Sant’Anna, Pisa, ItalyBio-Inspired Robotics Laboratory, Department of Engineering, University of Cambridge, Cambridge, UKThe BioRobotics Institute, Scuola Superiore Sant’Anna, Pisa, ItalyThe BioRobotics Institute, Scuola Superiore Sant’Anna, Pisa, ItalyEvolutionary studies have unequivocally proven the transition of living organisms from water to land. Consequently, it can be deduced that locomotion strategies must have evolved from one environment to the other. However, the mechanism by which this transition happened and its implications on bio-mechanical studies and robotics research have not been explored in detail. This paper presents a unifying control strategy for locomotion in varying environments based on the principle of ‘learning to stop’. Using a common reinforcement learning framework, deep deterministic policy gradient, we show that our proposed learning strategy facilitates a fast and safe methodology for transferring learned controllers from the facile water environment to the harsh land environment. Our results not only propose a plausible mechanism for safe and quick transition of locomotion strategies from a water to land environment but also provide a novel alternative for safer and faster training of robots.https://royalsocietypublishing.org/doi/10.1098/rsos.210223locomotionmodellingreinforcement learning
collection DOAJ
language English
format Article
sources DOAJ
author Thomas George Thuruthel
G. Picardi
F. Iida
C. Laschi
M. Calisti
spellingShingle Thomas George Thuruthel
G. Picardi
F. Iida
C. Laschi
M. Calisti
Learning to stop: a unifying principle for legged locomotion in varying environments
Royal Society Open Science
locomotion
modelling
reinforcement learning
author_facet Thomas George Thuruthel
G. Picardi
F. Iida
C. Laschi
M. Calisti
author_sort Thomas George Thuruthel
title Learning to stop: a unifying principle for legged locomotion in varying environments
title_short Learning to stop: a unifying principle for legged locomotion in varying environments
title_full Learning to stop: a unifying principle for legged locomotion in varying environments
title_fullStr Learning to stop: a unifying principle for legged locomotion in varying environments
title_full_unstemmed Learning to stop: a unifying principle for legged locomotion in varying environments
title_sort learning to stop: a unifying principle for legged locomotion in varying environments
publisher The Royal Society
series Royal Society Open Science
issn 2054-5703
publishDate 2021-04-01
description Evolutionary studies have unequivocally proven the transition of living organisms from water to land. Consequently, it can be deduced that locomotion strategies must have evolved from one environment to the other. However, the mechanism by which this transition happened and its implications on bio-mechanical studies and robotics research have not been explored in detail. This paper presents a unifying control strategy for locomotion in varying environments based on the principle of ‘learning to stop’. Using a common reinforcement learning framework, deep deterministic policy gradient, we show that our proposed learning strategy facilitates a fast and safe methodology for transferring learned controllers from the facile water environment to the harsh land environment. Our results not only propose a plausible mechanism for safe and quick transition of locomotion strategies from a water to land environment but also provide a novel alternative for safer and faster training of robots.
topic locomotion
modelling
reinforcement learning
url https://royalsocietypublishing.org/doi/10.1098/rsos.210223
work_keys_str_mv AT thomasgeorgethuruthel learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments
AT gpicardi learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments
AT fiida learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments
AT claschi learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments
AT mcalisti learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments
_version_ 1721385298733039616