3D Hand Pose Estimation via Graph-Based Reasoning

Hand pose estimation from a single depth image has recently received significant attention owing to its importance in many applications requiring human-computer interaction. The rapid progress of convolutional neural networks (CNNs) and technological advances in low-cost depth cameras have greatly i...

Full description

Bibliographic Details
Main Authors:	Jae-Hun Song, Suk-Ju Kang
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	3D hand pose estimation depth image graph convolutional network
Online Access:	https://ieeexplore.ieee.org/document/9361677/

id	doaj-963b1c2518884fc28461fa07d5ada81f
record_format	Article
spelling	doaj-963b1c2518884fc28461fa07d5ada81f2021-03-30T15:01:54ZengIEEEIEEE Access2169-35362021-01-019358243583310.1109/ACCESS.2021.306171693616773D Hand Pose Estimation via Graph-Based ReasoningJae-Hun Song0https://orcid.org/0000-0001-7180-2025Suk-Ju Kang1https://orcid.org/0000-0002-4809-956XDepartment of Electronic Engineering, Sogang University, Seoul, South KoreaDepartment of Electronic Engineering, Sogang University, Seoul, South KoreaHand pose estimation from a single depth image has recently received significant attention owing to its importance in many applications requiring human-computer interaction. The rapid progress of convolutional neural networks (CNNs) and technological advances in low-cost depth cameras have greatly improved the performance of the hand pose estimation method. Nevertheless, regressing joint coordinates is still a challenging task due to joint flexibility and self-occlusion. Previous hand pose estimation methods have limitations in relying on a deep and complex network structure without fully utilizing hand joint connections. A hand is an articulated object and consists of six parts that represent the palm and five fingers. The kinematic constraints can be obtained by modeling the dependency between adjacent joints. This paper proposes a novel CNN-based approach incorporating hand joint connections to features through both a global relation inference for the entire hand and local relation inference for each finger. Modeling the relations between the hand joints can alleviate critical problems for occlusion and self-similarity. We also present a hierarchical structure with six branches that independently estimate the position of the palm and five fingers by adding hand connections of each joint using graph reasoning based on graph convolutional networks. Experimental results on public hand pose datasets show that the proposed method outperforms previous state-of-the-art methods. Specifically, our method achieves the best accuracy compared to state-of-the-art methods on public datasets. In addition, the proposed method can be utilized for real-time applications with an execution speed of 103 fps in a single GPU environment.https://ieeexplore.ieee.org/document/9361677/3D hand pose estimationdepth imagegraph convolutional network
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Jae-Hun Song Suk-Ju Kang
spellingShingle	Jae-Hun Song Suk-Ju Kang 3D Hand Pose Estimation via Graph-Based Reasoning IEEE Access 3D hand pose estimation depth image graph convolutional network
author_facet	Jae-Hun Song Suk-Ju Kang
author_sort	Jae-Hun Song
title	3D Hand Pose Estimation via Graph-Based Reasoning
title_short	3D Hand Pose Estimation via Graph-Based Reasoning
title_full	3D Hand Pose Estimation via Graph-Based Reasoning
title_fullStr	3D Hand Pose Estimation via Graph-Based Reasoning
title_full_unstemmed	3D Hand Pose Estimation via Graph-Based Reasoning
title_sort	3d hand pose estimation via graph-based reasoning
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	Hand pose estimation from a single depth image has recently received significant attention owing to its importance in many applications requiring human-computer interaction. The rapid progress of convolutional neural networks (CNNs) and technological advances in low-cost depth cameras have greatly improved the performance of the hand pose estimation method. Nevertheless, regressing joint coordinates is still a challenging task due to joint flexibility and self-occlusion. Previous hand pose estimation methods have limitations in relying on a deep and complex network structure without fully utilizing hand joint connections. A hand is an articulated object and consists of six parts that represent the palm and five fingers. The kinematic constraints can be obtained by modeling the dependency between adjacent joints. This paper proposes a novel CNN-based approach incorporating hand joint connections to features through both a global relation inference for the entire hand and local relation inference for each finger. Modeling the relations between the hand joints can alleviate critical problems for occlusion and self-similarity. We also present a hierarchical structure with six branches that independently estimate the position of the palm and five fingers by adding hand connections of each joint using graph reasoning based on graph convolutional networks. Experimental results on public hand pose datasets show that the proposed method outperforms previous state-of-the-art methods. Specifically, our method achieves the best accuracy compared to state-of-the-art methods on public datasets. In addition, the proposed method can be utilized for real-time applications with an execution speed of 103 fps in a single GPU environment.
topic	3D hand pose estimation depth image graph convolutional network
url	https://ieeexplore.ieee.org/document/9361677/
work_keys_str_mv	AT jaehunsong 3dhandposeestimationviagraphbasedreasoning AT sukjukang 3dhandposeestimationviagraphbasedreasoning
_version_	1724180125051781120

3D Hand Pose Estimation via Graph-Based Reasoning

Similar Items