Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor

In this paper we propose a way of using depth maps transformed into 3D point clouds to classify human activities. The activities are described as time sequences of feature vectors based on the Viewpoint Feature Histogram descriptor (VFH) computed using the Point Cloud Library. Recognition is perform...

Full description

Bibliographic Details
Main Authors: Kamil Sidor, Marian Wysocki
Format: Article
Language:English
Published: MDPI AG 2020-05-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/20/10/2940
id doaj-3a71ebfa82c140baa20b441f216c8499
record_format Article
spelling doaj-3a71ebfa82c140baa20b441f216c84992020-11-25T03:03:49ZengMDPI AGSensors1424-82202020-05-01202940294010.3390/s20102940Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram DescriptorKamil Sidor0Marian Wysocki1Section of Informatization of the Course of Studies, Rzeszow University of Technology, al. Powstancow Warszawy, 12 35-959 Rzeszow, PolandDepartment of Computer and Control Engineering, Faculty of Electrical and Computer Engineering, Rzeszow University of Technology, W. Pola 2, 35-959 Rzeszow, PolandIn this paper we propose a way of using depth maps transformed into 3D point clouds to classify human activities. The activities are described as time sequences of feature vectors based on the Viewpoint Feature Histogram descriptor (VFH) computed using the Point Cloud Library. Recognition is performed by two types of classifiers: (i) k-NN nearest neighbors’ classifier with Dynamic Time Warping measure, (ii) bidirectional long short-term memory (BiLSTM) deep learning networks. Reduction of classification time for the k-NN by introducing a two tier model and improvement of BiLSTM-based classification via transfer learning and combining multiple networks by fuzzy integral are discussed. Our classification results obtained on two representative datasets: University of Texas at Dallas Multimodal Human Action Dataset and Mining Software Repositories Action 3D Dataset are comparable or better than the current state of the art.https://www.mdpi.com/1424-8220/20/10/2940point cloudsVFH descriptoractivity recognitiondynamic time warpingBiLSTMtransfer learning
collection DOAJ
language English
format Article
sources DOAJ
author Kamil Sidor
Marian Wysocki
spellingShingle Kamil Sidor
Marian Wysocki
Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
Sensors
point clouds
VFH descriptor
activity recognition
dynamic time warping
BiLSTM
transfer learning
author_facet Kamil Sidor
Marian Wysocki
author_sort Kamil Sidor
title Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
title_short Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
title_full Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
title_fullStr Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
title_full_unstemmed Recognition of Human Activities Using Depth Maps and the Viewpoint Feature Histogram Descriptor
title_sort recognition of human activities using depth maps and the viewpoint feature histogram descriptor
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2020-05-01
description In this paper we propose a way of using depth maps transformed into 3D point clouds to classify human activities. The activities are described as time sequences of feature vectors based on the Viewpoint Feature Histogram descriptor (VFH) computed using the Point Cloud Library. Recognition is performed by two types of classifiers: (i) k-NN nearest neighbors’ classifier with Dynamic Time Warping measure, (ii) bidirectional long short-term memory (BiLSTM) deep learning networks. Reduction of classification time for the k-NN by introducing a two tier model and improvement of BiLSTM-based classification via transfer learning and combining multiple networks by fuzzy integral are discussed. Our classification results obtained on two representative datasets: University of Texas at Dallas Multimodal Human Action Dataset and Mining Software Repositories Action 3D Dataset are comparable or better than the current state of the art.
topic point clouds
VFH descriptor
activity recognition
dynamic time warping
BiLSTM
transfer learning
url https://www.mdpi.com/1424-8220/20/10/2940
work_keys_str_mv AT kamilsidor recognitionofhumanactivitiesusingdepthmapsandtheviewpointfeaturehistogramdescriptor
AT marianwysocki recognitionofhumanactivitiesusingdepthmapsandtheviewpointfeaturehistogramdescriptor
_version_ 1724684435272499200