3D Human Pose Estimation With Spatial Structure Information
Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlu...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2021-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9363893/ |
id |
doaj-89cf614a6bdd40ea8d6e3796a48bd5c8 |
---|---|
record_format |
Article |
spelling |
doaj-89cf614a6bdd40ea8d6e3796a48bd5c82021-04-05T17:37:36ZengIEEEIEEE Access2169-35362021-01-019359473595610.1109/ACCESS.2021.306242693638933D Human Pose Estimation With Spatial Structure InformationXiaoshan Huang0https://orcid.org/0000-0003-3360-855XJun Huang1https://orcid.org/0000-0003-4939-3880Zengming Tang2https://orcid.org/0000-0001-5485-1829Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaEstimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.https://ieeexplore.ieee.org/document/9363893/3D human posesgraph convolutional networksadversarial learninggeometric priorsgradient vanishin-the-wild scenes |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaoshan Huang Jun Huang Zengming Tang |
spellingShingle |
Xiaoshan Huang Jun Huang Zengming Tang 3D Human Pose Estimation With Spatial Structure Information IEEE Access 3D human poses graph convolutional networks adversarial learning geometric priors gradient vanish in-the-wild scenes |
author_facet |
Xiaoshan Huang Jun Huang Zengming Tang |
author_sort |
Xiaoshan Huang |
title |
3D Human Pose Estimation With Spatial Structure Information |
title_short |
3D Human Pose Estimation With Spatial Structure Information |
title_full |
3D Human Pose Estimation With Spatial Structure Information |
title_fullStr |
3D Human Pose Estimation With Spatial Structure Information |
title_full_unstemmed |
3D Human Pose Estimation With Spatial Structure Information |
title_sort |
3d human pose estimation with spatial structure information |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2021-01-01 |
description |
Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process. |
topic |
3D human poses graph convolutional networks adversarial learning geometric priors gradient vanish in-the-wild scenes |
url |
https://ieeexplore.ieee.org/document/9363893/ |
work_keys_str_mv |
AT xiaoshanhuang 3dhumanposeestimationwithspatialstructureinformation AT junhuang 3dhumanposeestimationwithspatialstructureinformation AT zengmingtang 3dhumanposeestimationwithspatialstructureinformation |
_version_ |
1721539187012796416 |