Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection

In this study, we developed a framework to localize human lying poses by a camera positioned above. Our framework is motivated by the fact that detecting lying poses is fundamentally more difficult than detecting pedestrians or localizing nondeformable objects such as cars, roads, and buildings due...

Full description

Bibliographic Details
Main Authors: Daoxun Xia, Lingjin Zhao, Fang Guo, Xi Chen
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8941100/
id doaj-0bce902ca64f438fadb407b4bfd9fab4
record_format Article
spelling doaj-0bce902ca64f438fadb407b4bfd9fab42021-03-30T02:48:18ZengIEEEIEEE Access2169-35362020-01-0181080109010.1109/ACCESS.2019.29621008941100Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose DetectionDaoxun Xia0https://orcid.org/0000-0001-9596-0095Lingjin Zhao1https://orcid.org/0000-0002-0598-8551Fang Guo2https://orcid.org/0000-0003-1273-2156Xi Chen3https://orcid.org/0000-0002-4206-1958School of Big Data and Computer Science, Guizhou Normal University, Guiyang, ChinaEngineering Laboratory for Applied Technology of Big Data in Education, Guizhou Normal University, Guiyang, ChinaSchool of Big Data and Computer Science, Guizhou Normal University, Guiyang, ChinaSchool of Big Data and Computer Science, Guizhou Normal University, Guiyang, ChinaIn this study, we developed a framework to localize human lying poses by a camera positioned above. Our framework is motivated by the fact that detecting lying poses is fundamentally more difficult than detecting pedestrians or localizing nondeformable objects such as cars, roads, and buildings due to the large number of poses, orientations, and scales that a human lying on the ground can take. An important problem with lying pose detection is the training dataset, which hardly accounts for each possible body configuration. As a solution, we propose a geometric expansion procedure that uses a virtual camera to increase the number of training images. We also use a Gibbs sampler to generate more training samples in the feature space on which the system can train its model. Once the training is completed, detection is performed on a multiscale and multirotational space. Because our framework accommodates a variety of object detection systems, we report the results for the Faster R-CNN, FPN, and RefineDet models. The results show that using automatic dataset expansion models systematically improves the results.https://ieeexplore.ieee.org/document/8941100/Human lying pose detectionautomatic dataset expansionperspective transformationgibbs samplingdeep learning
collection DOAJ
language English
format Article
sources DOAJ
author Daoxun Xia
Lingjin Zhao
Fang Guo
Xi Chen
spellingShingle Daoxun Xia
Lingjin Zhao
Fang Guo
Xi Chen
Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
IEEE Access
Human lying pose detection
automatic dataset expansion
perspective transformation
gibbs sampling
deep learning
author_facet Daoxun Xia
Lingjin Zhao
Fang Guo
Xi Chen
author_sort Daoxun Xia
title Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
title_short Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
title_full Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
title_fullStr Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
title_full_unstemmed Automatic Dataset Expansion With Structured Feature Learning for Human Lying Pose Detection
title_sort automatic dataset expansion with structured feature learning for human lying pose detection
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In this study, we developed a framework to localize human lying poses by a camera positioned above. Our framework is motivated by the fact that detecting lying poses is fundamentally more difficult than detecting pedestrians or localizing nondeformable objects such as cars, roads, and buildings due to the large number of poses, orientations, and scales that a human lying on the ground can take. An important problem with lying pose detection is the training dataset, which hardly accounts for each possible body configuration. As a solution, we propose a geometric expansion procedure that uses a virtual camera to increase the number of training images. We also use a Gibbs sampler to generate more training samples in the feature space on which the system can train its model. Once the training is completed, detection is performed on a multiscale and multirotational space. Because our framework accommodates a variety of object detection systems, we report the results for the Faster R-CNN, FPN, and RefineDet models. The results show that using automatic dataset expansion models systematically improves the results.
topic Human lying pose detection
automatic dataset expansion
perspective transformation
gibbs sampling
deep learning
url https://ieeexplore.ieee.org/document/8941100/
work_keys_str_mv AT daoxunxia automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection
AT lingjinzhao automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection
AT fangguo automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection
AT xichen automaticdatasetexpansionwithstructuredfeaturelearningforhumanlyingposedetection
_version_ 1724184502027157504