3D Human Pose Estimation With Spatial Structure Information

Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlu...

Full description

Bibliographic Details
Main Authors:	Xiaoshan Huang, Jun Huang, Zengming Tang
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	3D human poses graph convolutional networks adversarial learning geometric priors gradient vanish in-the-wild scenes
Online Access:	https://ieeexplore.ieee.org/document/9363893/

id	doaj-89cf614a6bdd40ea8d6e3796a48bd5c8
record_format	Article
spelling	doaj-89cf614a6bdd40ea8d6e3796a48bd5c82021-04-05T17:37:36ZengIEEEIEEE Access2169-35362021-01-019359473595610.1109/ACCESS.2021.306242693638933D Human Pose Estimation With Spatial Structure InformationXiaoshan Huang0https://orcid.org/0000-0003-3360-855XJun Huang1https://orcid.org/0000-0003-4939-3880Zengming Tang2https://orcid.org/0000-0001-5485-1829Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaShanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, ChinaEstimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.https://ieeexplore.ieee.org/document/9363893/3D human posesgraph convolutional networksadversarial learninggeometric priorsgradient vanishin-the-wild scenes
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Xiaoshan Huang Jun Huang Zengming Tang
spellingShingle	Xiaoshan Huang Jun Huang Zengming Tang 3D Human Pose Estimation With Spatial Structure Information IEEE Access 3D human poses graph convolutional networks adversarial learning geometric priors gradient vanish in-the-wild scenes
author_facet	Xiaoshan Huang Jun Huang Zengming Tang
author_sort	Xiaoshan Huang
title	3D Human Pose Estimation With Spatial Structure Information
title_short	3D Human Pose Estimation With Spatial Structure Information
title_full	3D Human Pose Estimation With Spatial Structure Information
title_fullStr	3D Human Pose Estimation With Spatial Structure Information
title_full_unstemmed	3D Human Pose Estimation With Spatial Structure Information
title_sort	3d human pose estimation with spatial structure information
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	Estimating 3D human poses from 2D poses is a challenging problem due to joints self-occlusion, weak generalization, and inherent ambiguity of recovering depth. Actually, there exists spatial structure dependence on human body key points which can be used to alleviate the problem of joints self-occlusion. Therefore, we represent human pose as a directed graph and propose a network implemented with graph convolution to predict 3D poses from the given 2D poses. In the digraph, we determine the connection weight of each edge according to the error distribution of joints estimation. This makes our model robust to noise. By optimizing coarse 3D estimation and adversarial learning, our algorithm can successfully improve the accuracy of estimation and relieve the ambiguity of mapping. Through testing on Human 3.6M and MPI-INF-3DHP datasets, we achieve excellent quantitative performance. More importantly, our algorithm also has a superior generalization to outdoor dataset MPII by the pre-training process.
topic	3D human poses graph convolutional networks adversarial learning geometric priors gradient vanish in-the-wild scenes
url	https://ieeexplore.ieee.org/document/9363893/
work_keys_str_mv	AT xiaoshanhuang 3dhumanposeestimationwithspatialstructureinformation AT junhuang 3dhumanposeestimationwithspatialstructureinformation AT zengmingtang 3dhumanposeestimationwithspatialstructureinformation
_version_	1721539187012796416

3D Human Pose Estimation With Spatial Structure Information

Similar Items