Learning to stop: a unifying principle for legged locomotion in varying environments
Evolutionary studies have unequivocally proven the transition of living organisms from water to land. Consequently, it can be deduced that locomotion strategies must have evolved from one environment to the other. However, the mechanism by which this transition happened and its implications on bio-m...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
The Royal Society
2021-04-01
|
Series: | Royal Society Open Science |
Subjects: | |
Online Access: | https://royalsocietypublishing.org/doi/10.1098/rsos.210223 |
id |
doaj-bfb780533f4f48d58145528981769c12 |
---|---|
record_format |
Article |
spelling |
doaj-bfb780533f4f48d58145528981769c122021-06-10T08:57:26ZengThe Royal SocietyRoyal Society Open Science2054-57032021-04-018410.1098/rsos.210223Learning to stop: a unifying principle for legged locomotion in varying environmentsThomas George Thuruthel0G. Picardi1F. Iida2C. Laschi3M. Calisti4Bio-Inspired Robotics Laboratory, Department of Engineering, University of Cambridge, Cambridge, UKThe BioRobotics Institute, Scuola Superiore Sant’Anna, Pisa, ItalyBio-Inspired Robotics Laboratory, Department of Engineering, University of Cambridge, Cambridge, UKThe BioRobotics Institute, Scuola Superiore Sant’Anna, Pisa, ItalyThe BioRobotics Institute, Scuola Superiore Sant’Anna, Pisa, ItalyEvolutionary studies have unequivocally proven the transition of living organisms from water to land. Consequently, it can be deduced that locomotion strategies must have evolved from one environment to the other. However, the mechanism by which this transition happened and its implications on bio-mechanical studies and robotics research have not been explored in detail. This paper presents a unifying control strategy for locomotion in varying environments based on the principle of ‘learning to stop’. Using a common reinforcement learning framework, deep deterministic policy gradient, we show that our proposed learning strategy facilitates a fast and safe methodology for transferring learned controllers from the facile water environment to the harsh land environment. Our results not only propose a plausible mechanism for safe and quick transition of locomotion strategies from a water to land environment but also provide a novel alternative for safer and faster training of robots.https://royalsocietypublishing.org/doi/10.1098/rsos.210223locomotionmodellingreinforcement learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Thomas George Thuruthel G. Picardi F. Iida C. Laschi M. Calisti |
spellingShingle |
Thomas George Thuruthel G. Picardi F. Iida C. Laschi M. Calisti Learning to stop: a unifying principle for legged locomotion in varying environments Royal Society Open Science locomotion modelling reinforcement learning |
author_facet |
Thomas George Thuruthel G. Picardi F. Iida C. Laschi M. Calisti |
author_sort |
Thomas George Thuruthel |
title |
Learning to stop: a unifying principle for legged locomotion in varying environments |
title_short |
Learning to stop: a unifying principle for legged locomotion in varying environments |
title_full |
Learning to stop: a unifying principle for legged locomotion in varying environments |
title_fullStr |
Learning to stop: a unifying principle for legged locomotion in varying environments |
title_full_unstemmed |
Learning to stop: a unifying principle for legged locomotion in varying environments |
title_sort |
learning to stop: a unifying principle for legged locomotion in varying environments |
publisher |
The Royal Society |
series |
Royal Society Open Science |
issn |
2054-5703 |
publishDate |
2021-04-01 |
description |
Evolutionary studies have unequivocally proven the transition of living organisms from water to land. Consequently, it can be deduced that locomotion strategies must have evolved from one environment to the other. However, the mechanism by which this transition happened and its implications on bio-mechanical studies and robotics research have not been explored in detail. This paper presents a unifying control strategy for locomotion in varying environments based on the principle of ‘learning to stop’. Using a common reinforcement learning framework, deep deterministic policy gradient, we show that our proposed learning strategy facilitates a fast and safe methodology for transferring learned controllers from the facile water environment to the harsh land environment. Our results not only propose a plausible mechanism for safe and quick transition of locomotion strategies from a water to land environment but also provide a novel alternative for safer and faster training of robots. |
topic |
locomotion modelling reinforcement learning |
url |
https://royalsocietypublishing.org/doi/10.1098/rsos.210223 |
work_keys_str_mv |
AT thomasgeorgethuruthel learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments AT gpicardi learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments AT fiida learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments AT claschi learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments AT mcalisti learningtostopaunifyingprincipleforleggedlocomotioninvaryingenvironments |
_version_ |
1721385298733039616 |