Spatio-Temporal Scale Coded Bag-of-Words

The Bag-of-Words (BoW) framework has been widely used in action recognition tasks due to its compact and efficient feature representation. Various modifications have been made to this framework to increase its classification power. This often results in an increased complexity and reduced efficiency...

Full description

Bibliographic Details
Main Authors:	Divina Govender, Jules-Raymond Tapamo
Format:	Article
Language:	English
Published:	MDPI AG 2020-11-01
Series:	Sensors
Subjects:	action recognition Bag-of-Words computational efficiency real-time systems
Online Access:	https://www.mdpi.com/1424-8220/20/21/6380

id	doaj-19c31912e6a746e0a6bdbfd112600480
record_format	Article
spelling	doaj-19c31912e6a746e0a6bdbfd1126004802020-11-25T04:05:27ZengMDPI AGSensors1424-82202020-11-01206380638010.3390/s20216380Spatio-Temporal Scale Coded Bag-of-WordsDivina Govender0Jules-Raymond Tapamo1School of Engineering, University of KwaZulu-Natal, Durban 4041, South AfricaSchool of Engineering, University of KwaZulu-Natal, Durban 4041, South AfricaThe Bag-of-Words (BoW) framework has been widely used in action recognition tasks due to its compact and efficient feature representation. Various modifications have been made to this framework to increase its classification power. This often results in an increased complexity and reduced efficiency. Inspired by the success of image-based scale coded BoW representations, we propose a spatio-temporal scale coded BoW (SC-BoW) for video-based recognition. This involves encoding extracted multi-scale information into BoW representations by partitioning spatio-temporal features into sub-groups based on the spatial scale from which they were extracted. We evaluate SC-BoW in two experimental setups. We first present a general pipeline to perform real-time action recognition with SC-BoW. Secondly, we apply SC-BoW onto the popular Dense Trajectory feature set. Results showed SC-BoW representations to successfully improve performance by 2–7% with low added computational cost. Notably, SC-BoW on Dense Trajectories outperformed more complex deep learning approaches. Thus, scale coding is a low-cost and low-level encoding scheme that increases classification power of the standard BoW without compromising efficiency.https://www.mdpi.com/1424-8220/20/21/6380action recognitionBag-of-Wordscomputational efficiencyreal-time systems
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Divina Govender Jules-Raymond Tapamo
spellingShingle	Divina Govender Jules-Raymond Tapamo Spatio-Temporal Scale Coded Bag-of-Words Sensors action recognition Bag-of-Words computational efficiency real-time systems
author_facet	Divina Govender Jules-Raymond Tapamo
author_sort	Divina Govender
title	Spatio-Temporal Scale Coded Bag-of-Words
title_short	Spatio-Temporal Scale Coded Bag-of-Words
title_full	Spatio-Temporal Scale Coded Bag-of-Words
title_fullStr	Spatio-Temporal Scale Coded Bag-of-Words
title_full_unstemmed	Spatio-Temporal Scale Coded Bag-of-Words
title_sort	spatio-temporal scale coded bag-of-words
publisher	MDPI AG
series	Sensors
issn	1424-8220
publishDate	2020-11-01
description	The Bag-of-Words (BoW) framework has been widely used in action recognition tasks due to its compact and efficient feature representation. Various modifications have been made to this framework to increase its classification power. This often results in an increased complexity and reduced efficiency. Inspired by the success of image-based scale coded BoW representations, we propose a spatio-temporal scale coded BoW (SC-BoW) for video-based recognition. This involves encoding extracted multi-scale information into BoW representations by partitioning spatio-temporal features into sub-groups based on the spatial scale from which they were extracted. We evaluate SC-BoW in two experimental setups. We first present a general pipeline to perform real-time action recognition with SC-BoW. Secondly, we apply SC-BoW onto the popular Dense Trajectory feature set. Results showed SC-BoW representations to successfully improve performance by 2–7% with low added computational cost. Notably, SC-BoW on Dense Trajectories outperformed more complex deep learning approaches. Thus, scale coding is a low-cost and low-level encoding scheme that increases classification power of the standard BoW without compromising efficiency.
topic	action recognition Bag-of-Words computational efficiency real-time systems
url	https://www.mdpi.com/1424-8220/20/21/6380
work_keys_str_mv	AT divinagovender spatiotemporalscalecodedbagofwords AT julesraymondtapamo spatiotemporalscalecodedbagofwords
_version_	1724433867379572736

Spatio-Temporal Scale Coded Bag-of-Words

Similar Items