Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation
Hand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand p...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-07-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/19/13/2919 |
id |
doaj-70ec9eac5ced4a42bbe0915aa17c58c3 |
---|---|
record_format |
Article |
spelling |
doaj-70ec9eac5ced4a42bbe0915aa17c58c32020-11-25T00:27:31ZengMDPI AGSensors1424-82202019-07-011913291910.3390/s19132919s19132919Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose EstimationWangyong He0Zhongzhao Xie1Yongbo Li2Xinmei Wang3Wendi Cai4School of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaSchool of Automation, China University of Geosciences, Wuhan 430074, ChinaHand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand pose, the developed method can generate depth hand images. To be specific, a ground truth can be 3D hand poses with the hand structure contained, while the synthesized image has an identical size to that of the training image and a similar visual appearance to the training set. The developed method, inspired by the progress in the generative adversarial network (GAN) and image-style transfer, helps model the latent statistical relationship between the ground-truth hand pose and the corresponding depth hand image. The images synthesized using the developed method are demonstrated to be feasible for enhancing performance. On public hand pose datasets (NYU, MSRA, ICVL), comprehensive experiments prove that the developed method outperforms the existing works.https://www.mdpi.com/1424-8220/19/13/2919hand pose estimationgenerative adversarial networksstyle transferhuman-computer interactiondepth images |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wangyong He Zhongzhao Xie Yongbo Li Xinmei Wang Wendi Cai |
spellingShingle |
Wangyong He Zhongzhao Xie Yongbo Li Xinmei Wang Wendi Cai Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation Sensors hand pose estimation generative adversarial networks style transfer human-computer interaction depth images |
author_facet |
Wangyong He Zhongzhao Xie Yongbo Li Xinmei Wang Wendi Cai |
author_sort |
Wangyong He |
title |
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation |
title_short |
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation |
title_full |
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation |
title_fullStr |
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation |
title_full_unstemmed |
Synthesizing Depth Hand Images with GANs and Style Transfer for Hand Pose Estimation |
title_sort |
synthesizing depth hand images with gans and style transfer for hand pose estimation |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2019-07-01 |
description |
Hand pose estimation is a critical technology of computer vision and human-computer interaction. Deep-learning methods require a considerable amount of tagged data. Accordingly, numerous labeled training data are required. This paper aims to generate depth hand images. Given a ground-truth 3D hand pose, the developed method can generate depth hand images. To be specific, a ground truth can be 3D hand poses with the hand structure contained, while the synthesized image has an identical size to that of the training image and a similar visual appearance to the training set. The developed method, inspired by the progress in the generative adversarial network (GAN) and image-style transfer, helps model the latent statistical relationship between the ground-truth hand pose and the corresponding depth hand image. The images synthesized using the developed method are demonstrated to be feasible for enhancing performance. On public hand pose datasets (NYU, MSRA, ICVL), comprehensive experiments prove that the developed method outperforms the existing works. |
topic |
hand pose estimation generative adversarial networks style transfer human-computer interaction depth images |
url |
https://www.mdpi.com/1424-8220/19/13/2919 |
work_keys_str_mv |
AT wangyonghe synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation AT zhongzhaoxie synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation AT yongboli synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation AT xinmeiwang synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation AT wendicai synthesizingdepthhandimageswithgansandstyletransferforhandposeestimation |
_version_ |
1725339441668554752 |