Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
In this study, we developed a framework to localize human lying poses by a camera positioned above. Our framework is motivated by the fact that detecting lying poses is fundamentally more difficult than detecting pedestrians or localizing nondeformable objects such as cars, roads, and buildings due...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8941100/ |
id |
doaj-0bce902ca64f438fadb407b4bfd9fab4 |
---|---|
record_format |
Article |
spelling |
doaj-0bce902ca64f438fadb407b4bfd9fab42021-03-30T02:48:18ZengIEEEIEEE Access2169-35362020-01-0181080109010.1109/ACCESS.2019.29621008941100Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose DetectionDaoxun Xia0https://orcid.org/0000-0001-9596-0095Lingjin Zhao1https://orcid.org/0000-0002-0598-8551Fang Guo2https://orcid.org/0000-0003-1273-2156Xi Chen3https://orcid.org/0000-0002-4206-1958School of Big Data and Computer Science, Guizhou Normal University, Guiyang, ChinaEngineering Laboratory for Applied Technology of Big Data in Education, Guizhou Normal University, Guiyang, ChinaSchool of Big Data and Computer Science, Guizhou Normal University, Guiyang, ChinaSchool of Big Data and Computer Science, Guizhou Normal University, Guiyang, ChinaIn this study, we developed a framework to localize human lying poses by a camera positioned above. Our framework is motivated by the fact that detecting lying poses is fundamentally more difficult than detecting pedestrians or localizing nondeformable objects such as cars, roads, and buildings due to the large number of poses, orientations, and scales that a human lying on the ground can take. An important problem with lying pose detection is the training dataset, which hardly accounts for each possible body configuration. As a solution, we propose a geometric expansion procedure that uses a virtual camera to increase the number of training images. We also use a Gibbs sampler to generate more training samples in the feature space on which the system can train its model. Once the training is completed, detection is performed on a multiscale and multirotational space. Because our framework accommodates a variety of object detection systems, we report the results for the Faster R-CNN, FPN, and RefineDet models. The results show that using automatic dataset expansion models systematically improves the results.https://ieeexplore.ieee.org/document/8941100/Human lying pose detectionautomatic dataset expansionperspective transformationgibbs samplingdeep learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Daoxun Xia Lingjin Zhao Fang Guo Xi Chen |
spellingShingle |
Daoxun Xia Lingjin Zhao Fang Guo Xi Chen Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection IEEE Access Human lying pose detection automatic dataset expansion perspective transformation gibbs sampling deep learning |
author_facet |
Daoxun Xia Lingjin Zhao Fang Guo Xi Chen |
author_sort |
Daoxun Xia |
title |
Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection |
title_short |
Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection |
title_full |
Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection |
title_fullStr |
Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection |
title_full_unstemmed |
Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection |
title_sort |
automatic dataset expansion with structured feature learning for human lying pose detection |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
In this study, we developed a framework to localize human lying poses by a camera positioned above. Our framework is motivated by the fact that detecting lying poses is fundamentally more difficult than detecting pedestrians or localizing nondeformable objects such as cars, roads, and buildings due to the large number of poses, orientations, and scales that a human lying on the ground can take. An important problem with lying pose detection is the training dataset, which hardly accounts for each possible body configuration. As a solution, we propose a geometric expansion procedure that uses a virtual camera to increase the number of training images. We also use a Gibbs sampler to generate more training samples in the feature space on which the system can train its model. Once the training is completed, detection is performed on a multiscale and multirotational space. Because our framework accommodates a variety of object detection systems, we report the results for the Faster R-CNN, FPN, and RefineDet models. The results show that using automatic dataset expansion models systematically improves the results. |
topic |
Human lying pose detection automatic dataset expansion perspective transformation gibbs sampling deep learning |
url |
https://ieeexplore.ieee.org/document/8941100/ |
work_keys_str_mv |
AT daoxunxia automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection AT lingjinzhao automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection AT fangguo automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection AT xichen automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection |
_version_ |
1724184502027157504 |