Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
In this paper we propose a way of using depth maps transformed into 3D point clouds to classify human activities. The activities are described as time sequences of feature vectors based on the Viewpoint Feature Histogram descriptor (VFH) computed using the Point Cloud Library. Recognition is perform...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-05-01
|
Series: | Sensors |
Subjects: | |
Online Access: | https://www.mdpi.com/1424-8220/20/10/2940 |
id |
doaj-3a71ebfa82c140baa20b441f216c8499 |
---|---|
record_format |
Article |
spelling |
doaj-3a71ebfa82c140baa20b441f216c84992020-11-25T03:03:49ZengMDPI AGSensors1424-82202020-05-01202940294010.3390/s20102940Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram DescriptorKamil Sidor0Marian Wysocki1Section of Informatization of the Course of Studies, Rzeszow University of Technology, al. Powstancow Warszawy, 12 35-959 Rzeszow, PolandDepartment of Computer and Control Engineering, Faculty of Electrical and Computer Engineering, Rzeszow University of Technology, W. Pola 2, 35-959 Rzeszow, PolandIn this paper we propose a way of using depth maps transformed into 3D point clouds to classify human activities. The activities are described as time sequences of feature vectors based on the Viewpoint Feature Histogram descriptor (VFH) computed using the Point Cloud Library. Recognition is performed by two types of classifiers: (i) k-NN nearest neighbors’ classifier with Dynamic Time Warping measure, (ii) bidirectional long short-term memory (BiLSTM) deep learning networks. Reduction of classification time for the k-NN by introducing a two tier model and improvement of BiLSTM-based classification via transfer learning and combining multiple networks by fuzzy integral are discussed. Our classification results obtained on two representative datasets: University of Texas at Dallas Multimodal Human Action Dataset and Mining Software Repositories Action 3D Dataset are comparable or better than the current state of the art.https://www.mdpi.com/1424-8220/20/10/2940point cloudsVFH descriptoractivity recognitiondynamic time warpingBiLSTMtransfer learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kamil Sidor Marian Wysocki |
spellingShingle |
Kamil Sidor Marian Wysocki Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor Sensors point clouds VFH descriptor activity recognition dynamic time warping BiLSTM transfer learning |
author_facet |
Kamil Sidor Marian Wysocki |
author_sort |
Kamil Sidor |
title |
Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor |
title_short |
Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor |
title_full |
Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor |
title_fullStr |
Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor |
title_full_unstemmed |
Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor |
title_sort |
recognition of human activities using depth maps and the viewpoint feature histogram descriptor |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2020-05-01 |
description |
In this paper we propose a way of using depth maps transformed into 3D point clouds to classify human activities. The activities are described as time sequences of feature vectors based on the Viewpoint Feature Histogram descriptor (VFH) computed using the Point Cloud Library. Recognition is performed by two types of classifiers: (i) k-NN nearest neighbors’ classifier with Dynamic Time Warping measure, (ii) bidirectional long short-term memory (BiLSTM) deep learning networks. Reduction of classification time for the k-NN by introducing a two tier model and improvement of BiLSTM-based classification via transfer learning and combining multiple networks by fuzzy integral are discussed. Our classification results obtained on two representative datasets: University of Texas at Dallas Multimodal Human Action Dataset and Mining Software Repositories Action 3D Dataset are comparable or better than the current state of the art. |
topic |
point clouds VFH descriptor activity recognition dynamic time warping BiLSTM transfer learning |
url |
https://www.mdpi.com/1424-8220/20/10/2940 |
work_keys_str_mv |
AT kamilsidor recognitionofhumanactivitiesusingdepthmapsandtheviewpointfeaturehistogramdescriptor AT marianwysocki recognitionofhumanactivitiesusingdepthmapsandtheviewpointfeaturehistogramdescriptor |
_version_ |
1724684435272499200 |