A Cuboid Bi-Level Log Operator for Action Classification

We propose a 3-D cuboid bi-level Laplacian-of-Gaussian (CBLoG) operator with high speed and invariant space-time scale for detecting abrupt changes of signals in videos. This 3-D CBLoG operator is applied in the detection of spatio-temporal interest points in videos. Then, we extract rich feature de...

Full description

Bibliographic Details
Main Authors: Kaibo Yao, Nong Sang, Changxin Gao
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8470086/
id doaj-a84f56234c414f35955a338d38bd2729
record_format Article
spelling doaj-a84f56234c414f35955a338d38bd27292021-03-29T21:14:57ZengIEEEIEEE Access2169-35362018-01-016541475415710.1109/ACCESS.2018.28717338470086A Cuboid Bi-Level Log Operator for Action ClassificationKaibo Yao0https://orcid.org/0000-0003-0234-9639Nong Sang1Changxin Gao2National Key Laboratory of Science and Technology on Multispectral Information Processing, School of Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Science and Technology on Multispectral Information Processing, School of Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Science and Technology on Multispectral Information Processing, School of Automation, Huazhong University of Science and Technology, Wuhan, ChinaWe propose a 3-D cuboid bi-level Laplacian-of-Gaussian (CBLoG) operator with high speed and invariant space-time scale for detecting abrupt changes of signals in videos. This 3-D CBLoG operator is applied in the detection of spatio-temporal interest points in videos. Then, we extract rich feature descriptors, including HOG, HOF, and MBH (motion boundary histogram) around the detected points, and use Fisher vector to encoding these descriptors and their combinations as action representation. In our experiments, we evaluated our recognition method on two standard data sets: KTH and UCF Sport action data sets. The best accuracy on the KTH data set achieves at 99.3%, and on the UCF Sports data set, it is 90.0%. Compared with the existing approaches, our method outperforms the state of the art tested on the KTH data set and achieves impressive performance on the UCF sports data set.https://ieeexplore.ieee.org/document/8470086/Cuboid bi-level LoG (CBLoG) operatorspatio-temporal interest point (STIP)Fisher vectoraction recognition
collection DOAJ
language English
format Article
sources DOAJ
author Kaibo Yao
Nong Sang
Changxin Gao
spellingShingle Kaibo Yao
Nong Sang
Changxin Gao
A Cuboid Bi-Level Log Operator for Action Classification
IEEE Access
Cuboid bi-level LoG (CBLoG) operator
spatio-temporal interest point (STIP)
Fisher vector
action recognition
author_facet Kaibo Yao
Nong Sang
Changxin Gao
author_sort Kaibo Yao
title A Cuboid Bi-Level Log Operator for Action Classification
title_short A Cuboid Bi-Level Log Operator for Action Classification
title_full A Cuboid Bi-Level Log Operator for Action Classification
title_fullStr A Cuboid Bi-Level Log Operator for Action Classification
title_full_unstemmed A Cuboid Bi-Level Log Operator for Action Classification
title_sort cuboid bi-level log operator for action classification
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description We propose a 3-D cuboid bi-level Laplacian-of-Gaussian (CBLoG) operator with high speed and invariant space-time scale for detecting abrupt changes of signals in videos. This 3-D CBLoG operator is applied in the detection of spatio-temporal interest points in videos. Then, we extract rich feature descriptors, including HOG, HOF, and MBH (motion boundary histogram) around the detected points, and use Fisher vector to encoding these descriptors and their combinations as action representation. In our experiments, we evaluated our recognition method on two standard data sets: KTH and UCF Sport action data sets. The best accuracy on the KTH data set achieves at 99.3%, and on the UCF Sports data set, it is 90.0%. Compared with the existing approaches, our method outperforms the state of the art tested on the KTH data set and achieves impressive performance on the UCF sports data set.
topic Cuboid bi-level LoG (CBLoG) operator
spatio-temporal interest point (STIP)
Fisher vector
action recognition
url https://ieeexplore.ieee.org/document/8470086/
work_keys_str_mv AT kaiboyao acuboidbilevellogoperatorforactionclassification
AT nongsang acuboidbilevellogoperatorforactionclassification
AT changxingao acuboidbilevellogoperatorforactionclassification
AT kaiboyao cuboidbilevellogoperatorforactionclassification
AT nongsang cuboidbilevellogoperatorforactionclassification
AT changxingao cuboidbilevellogoperatorforactionclassification
_version_ 1724193331429244928