Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization

To improve the robustness of biped walking, a model parameters optimization method based on policy gradient decent learning is presented. For the linear inverted pendulum mode-based model parameters optimization, firstly, select the input parameters of the inverted pendulum model and the torso attit...

Full description

Bibliographic Details
Main Authors: Chengju Liu, Jing Ning, Qijun Chen
Format: Article
Language:English
Published: SAGE Publishing 2018-01-01
Series:International Journal of Advanced Robotic Systems
Online Access:https://doi.org/10.1177/1729881417749672
id doaj-cc844a981b5949eda443aa588158f692
record_format Article
spelling doaj-cc844a981b5949eda443aa588158f6922020-11-25T03:17:35ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142018-01-011510.1177/1729881417749672Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimizationChengju LiuJing NingQijun ChenTo improve the robustness of biped walking, a model parameters optimization method based on policy gradient decent learning is presented. For the linear inverted pendulum mode-based model parameters optimization, firstly, select the input parameters of the inverted pendulum model and the torso attitude parameters of the robot as the correction variables and establish the correction equation. Then, using the tracking errors of center of mass (CoM) of the robot and the errors of the robot posture relative to the upright state of the body to establish the fitness function. According to the fitness function, the gain coefficients in the model parameters correction equation are optimized by using the strategy gradient learning method, and the modified gain parameters are substituted into the model parameters correction equation to obtain the correction amount. By applying the model parameters optimization strategy, the robot can quickly and in real time adjust the body posture and walking patterns under unknown disturbances, hence, the walking robustness can be enhanced. Simulation and experiments on a full-body humanoid robot NAO validate the effectiveness of the proposed method. The experiments show that the optimized model yields a more controlled, robust walk on NAO robot and on various surfaces without additional manual parameters tuning.https://doi.org/10.1177/1729881417749672
collection DOAJ
language English
format Article
sources DOAJ
author Chengju Liu
Jing Ning
Qijun Chen
spellingShingle Chengju Liu
Jing Ning
Qijun Chen
Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
International Journal of Advanced Robotic Systems
author_facet Chengju Liu
Jing Ning
Qijun Chen
author_sort Chengju Liu
title Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
title_short Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
title_full Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
title_fullStr Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
title_full_unstemmed Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
title_sort dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
publisher SAGE Publishing
series International Journal of Advanced Robotic Systems
issn 1729-8814
publishDate 2018-01-01
description To improve the robustness of biped walking, a model parameters optimization method based on policy gradient decent learning is presented. For the linear inverted pendulum mode-based model parameters optimization, firstly, select the input parameters of the inverted pendulum model and the torso attitude parameters of the robot as the correction variables and establish the correction equation. Then, using the tracking errors of center of mass (CoM) of the robot and the errors of the robot posture relative to the upright state of the body to establish the fitness function. According to the fitness function, the gain coefficients in the model parameters correction equation are optimized by using the strategy gradient learning method, and the modified gain parameters are substituted into the model parameters correction equation to obtain the correction amount. By applying the model parameters optimization strategy, the robot can quickly and in real time adjust the body posture and walking patterns under unknown disturbances, hence, the walking robustness can be enhanced. Simulation and experiments on a full-body humanoid robot NAO validate the effectiveness of the proposed method. The experiments show that the optimized model yields a more controlled, robust walk on NAO robot and on various surfaces without additional manual parameters tuning.
url https://doi.org/10.1177/1729881417749672
work_keys_str_mv AT chengjuliu dynamicwalkingcontrolofhumanoidrobotscombininglinearinvertedpendulummodewithparameteroptimization
AT jingning dynamicwalkingcontrolofhumanoidrobotscombininglinearinvertedpendulummodewithparameteroptimization
AT qijunchen dynamicwalkingcontrolofhumanoidrobotscombininglinearinvertedpendulummodewithparameteroptimization
_version_ 1724631302807748608