3D Human Pose Estimation With Spatial Structure Information

Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlu...

Full description

Bibliographic Details
Main Authors: Xiaoshan Huang, Jun Huang, Zengming Tang
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9363893/
id doaj-89cf614a6bdd40ea8d6e3796a48bd5c8
record_format Article
spelling doaj-89cf614a6bdd40ea8d6e3796a48bd5c82021-04-05T17:37:36ZengIEEEIEEE Access2169-35362021-01-019359473595610.1109/ACCESS.2021.306242693638933D Human Pose Estimation With Spatial Structure InformationXiaoshan Huang0https://orcid.org/0000-0003-3360-855XJun Huang1https://orcid.org/0000-0003-4939-3880Zengming Tang2https://orcid.org/0000-0001-5485-1829Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaEstimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.https://ieeexplore.ieee.org/document/9363893/3D human posesgraph convolutional networksadversarial learninggeometric priorsgradient vanishin-the-wild scenes
collection DOAJ
language English
format Article
sources DOAJ
author Xiaoshan Huang
Jun Huang
Zengming Tang
spellingShingle Xiaoshan Huang
Jun Huang
Zengming Tang
3D Human Pose Estimation With Spatial Structure Information
IEEE Access
3D human poses
graph convolutional networks
adversarial learning
geometric priors
gradient vanish
in-the-wild scenes
author_facet Xiaoshan Huang
Jun Huang
Zengming Tang
author_sort Xiaoshan Huang
title 3D Human Pose Estimation With Spatial Structure Information
title_short 3D Human Pose Estimation With Spatial Structure Information
title_full 3D Human Pose Estimation With Spatial Structure Information
title_fullStr 3D Human Pose Estimation With Spatial Structure Information
title_full_unstemmed 3D Human Pose Estimation With Spatial Structure Information
title_sort 3d human pose estimation with spatial structure information
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.
topic 3D human poses
graph convolutional networks
adversarial learning
geometric priors
gradient vanish
in-the-wild scenes
url https://ieeexplore.ieee.org/document/9363893/
work_keys_str_mv AT xiaoshanhuang 3dhumanposeestimationwithspatialstructureinformation
AT junhuang 3dhumanposeestimationwithspatialstructureinformation
AT zengmingtang 3dhumanposeestimationwithspatialstructureinformation
_version_ 1721539187012796416