Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization
To improve the robustness of biped walking, a model parameters optimization method based on policy gradient decent learning is presented. For the linear inverted pendulum mode-based model parameters optimization, firstly, select the input parameters of the inverted pendulum model and the torso attit...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SAGE Publishing
2018-01-01
|
Series: | International Journal of Advanced Robotic Systems |
Online Access: | https://doi.org/10.1177/1729881417749672 |
id |
doaj-cc844a981b5949eda443aa588158f692 |
---|---|
record_format |
Article |
spelling |
doaj-cc844a981b5949eda443aa588158f6922020-11-25T03:17:35ZengSAGE PublishingInternational Journal of Advanced Robotic Systems1729-88142018-01-011510.1177/1729881417749672Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimizationChengju LiuJing NingQijun ChenTo improve the robustness of biped walking, a model parameters optimization method based on policy gradient decent learning is presented. For the linear inverted pendulum mode-based model parameters optimization, firstly, select the input parameters of the inverted pendulum model and the torso attitude parameters of the robot as the correction variables and establish the correction equation. Then, using the tracking errors of center of mass (CoM) of the robot and the errors of the robot posture relative to the upright state of the body to establish the fitness function. According to the fitness function, the gain coefficients in the model parameters correction equation are optimized by using the strategy gradient learning method, and the modified gain parameters are substituted into the model parameters correction equation to obtain the correction amount. By applying the model parameters optimization strategy, the robot can quickly and in real time adjust the body posture and walking patterns under unknown disturbances, hence, the walking robustness can be enhanced. Simulation and experiments on a full-body humanoid robot NAO validate the effectiveness of the proposed method. The experiments show that the optimized model yields a more controlled, robust walk on NAO robot and on various surfaces without additional manual parameters tuning.https://doi.org/10.1177/1729881417749672 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chengju Liu Jing Ning Qijun Chen |
spellingShingle |
Chengju Liu Jing Ning Qijun Chen Dynamic walking control of humanoid robots combining linear inverted pendulum mode with parameter optimization International Journal of Advanced Robotic Systems |
author_facet |
Chengju Liu Jing Ning Qijun Chen |
author_sort |
Chengju Liu |
title |
Dynamic walking control of humanoid robots combining linear inverted pendulum
mode with parameter optimization |
title_short |
Dynamic walking control of humanoid robots combining linear inverted pendulum
mode with parameter optimization |
title_full |
Dynamic walking control of humanoid robots combining linear inverted pendulum
mode with parameter optimization |
title_fullStr |
Dynamic walking control of humanoid robots combining linear inverted pendulum
mode with parameter optimization |
title_full_unstemmed |
Dynamic walking control of humanoid robots combining linear inverted pendulum
mode with parameter optimization |
title_sort |
dynamic walking control of humanoid robots combining linear inverted pendulum
mode with parameter optimization |
publisher |
SAGE Publishing |
series |
International Journal of Advanced Robotic Systems |
issn |
1729-8814 |
publishDate |
2018-01-01 |
description |
To improve the robustness of biped walking, a model parameters optimization method based on policy gradient decent learning is presented. For the linear inverted pendulum mode-based model parameters optimization, firstly, select the input parameters of the inverted pendulum model and the torso attitude parameters of the robot as the correction variables and establish the correction equation. Then, using the tracking errors of center of mass (CoM) of the robot and the errors of the robot posture relative to the upright state of the body to establish the fitness function. According to the fitness function, the gain coefficients in the model parameters correction equation are optimized by using the strategy gradient learning method, and the modified gain parameters are substituted into the model parameters correction equation to obtain the correction amount. By applying the model parameters optimization strategy, the robot can quickly and in real time adjust the body posture and walking patterns under unknown disturbances, hence, the walking robustness can be enhanced. Simulation and experiments on a full-body humanoid robot NAO validate the effectiveness of the proposed method. The experiments show that the optimized model yields a more controlled, robust walk on NAO robot and on various surfaces without additional manual parameters tuning. |
url |
https://doi.org/10.1177/1729881417749672 |
work_keys_str_mv |
AT chengjuliu dynamicwalkingcontrolofhumanoidrobotscombininglinearinvertedpendulummodewithparameteroptimization AT jingning dynamicwalkingcontrolofhumanoidrobotscombininglinearinvertedpendulummodewithparameteroptimization AT qijunchen dynamicwalkingcontrolofhumanoidrobotscombininglinearinvertedpendulummodewithparameteroptimization |
_version_ |
1724631302807748608 |