Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation

Hand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand p...

Full description

Bibliographic Details
Main Authors: Wangyong He, Zhongzhao Xie, Yongbo Li, Xinmei Wang, Wendi Cai
Format: Article
Language:English
Published: MDPI AG 2019-07-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/19/13/2919
id doaj-70ec9eac5ced4a42bbe0915aa17c58c3
record_format Article
spelling doaj-70ec9eac5ced4a42bbe0915aa17c58c32020-11-25T00:27:31ZengMDPI AGSensors1424-82202019-07-011913291910.3390/s19132919s19132919Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose EstimationWangyong He0Zhongzhao Xie1Yongbo Li2Xinmei Wang3Wendi Cai4School of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaHand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand pose, the developed method can generate depth hand images. To be specific, a ground truth can be 3D hand poses with the hand structure contained, while the synthesized image has an identical size to that of the training image and a similar visual appearance to the training set. The developed method, inspired by the progress in the generative adversarial network (GAN) and image-style transfer, helps model the latent statistical relationship between the ground-truth hand pose and the corresponding depth hand image. The images synthesized using the developed method are demonstrated to be feasible for enhancing performance. On public hand pose datasets (NYU, MSRA, ICVL), comprehensive experiments prove that the developed method outperforms the existing works.https://www.mdpi.com/1424-8220/19/13/2919hand pose estimationgenerative adversarial networksstyle transferhuman-computer interactiondepth images
collection DOAJ
language English
format Article
sources DOAJ
author Wangyong He
Zhongzhao Xie
Yongbo Li
Xinmei Wang
Wendi Cai
spellingShingle Wangyong He
Zhongzhao Xie
Yongbo Li
Xinmei Wang
Wendi Cai
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
Sensors
hand pose estimation
generative adversarial networks
style transfer
human-computer interaction
depth images
author_facet Wangyong He
Zhongzhao Xie
Yongbo Li
Xinmei Wang
Wendi Cai
author_sort Wangyong He
title Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
title_short Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
title_full Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
title_fullStr Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
title_full_unstemmed Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
title_sort synthesizing depth hand images with gans and style transfer for hand pose estimation
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2019-07-01
description Hand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand pose, the developed method can generate depth hand images. To be specific, a ground truth can be 3D hand poses with the hand structure contained, while the synthesized image has an identical size to that of the training image and a similar visual appearance to the training set. The developed method, inspired by the progress in the generative adversarial network (GAN) and image-style transfer, helps model the latent statistical relationship between the ground-truth hand pose and the corresponding depth hand image. The images synthesized using the developed method are demonstrated to be feasible for enhancing performance. On public hand pose datasets (NYU, MSRA, ICVL), comprehensive experiments prove that the developed method outperforms the existing works.
topic hand pose estimation
generative adversarial networks
style transfer
human-computer interaction
depth images
url https://www.mdpi.com/1424-8220/19/13/2919
work_keys_str_mv AT wangyonghe synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation
AT zhongzhaoxie synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation
AT yongboli synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation
AT xinmeiwang synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation
AT wendicai synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation
_version_ 1725339441668554752