A Cuboid Bi-Level Log Operator for Action Classification
We propose a 3-D cuboid bi-level Laplacian-of-Gaussian (CBLoG) operator with high speed and invariant space-time scale for detecting abrupt changes of signals in videos. This 3-D CBLoG operator is applied in the detection of spatio-temporal interest points in videos. Then, we extract rich feature de...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8470086/ |
id |
doaj-a84f56234c414f35955a338d38bd2729 |
---|---|
record_format |
Article |
spelling |
doaj-a84f56234c414f35955a338d38bd27292021-03-29T21:14:57ZengIEEEIEEE Access2169-35362018-01-016541475415710.1109/ACCESS.2018.28717338470086A Cuboid Bi-Level Log Operator for Action ClassificationKaibo Yao0https://orcid.org/0000-0003-0234-9639Nong Sang1Changxin Gao2National Key Laboratory of Science and Technology on Multispectral Information Processing, School of Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Science and Technology on Multispectral Information Processing, School of Automation, Huazhong University of Science and Technology, Wuhan, ChinaNational Key Laboratory of Science and Technology on Multispectral Information Processing, School of Automation, Huazhong University of Science and Technology, Wuhan, ChinaWe propose a 3-D cuboid bi-level Laplacian-of-Gaussian (CBLoG) operator with high speed and invariant space-time scale for detecting abrupt changes of signals in videos. This 3-D CBLoG operator is applied in the detection of spatio-temporal interest points in videos. Then, we extract rich feature descriptors, including HOG, HOF, and MBH (motion boundary histogram) around the detected points, and use Fisher vector to encoding these descriptors and their combinations as action representation. In our experiments, we evaluated our recognition method on two standard data sets: KTH and UCF Sport action data sets. The best accuracy on the KTH data set achieves at 99.3%, and on the UCF Sports data set, it is 90.0%. Compared with the existing approaches, our method outperforms the state of the art tested on the KTH data set and achieves impressive performance on the UCF sports data set.https://ieeexplore.ieee.org/document/8470086/Cuboid bi-level LoG (CBLoG) operatorspatio-temporal interest point (STIP)Fisher vectoraction recognition |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kaibo Yao Nong Sang Changxin Gao |
spellingShingle |
Kaibo Yao Nong Sang Changxin Gao A Cuboid Bi-Level Log Operator for Action Classification IEEE Access Cuboid bi-level LoG (CBLoG) operator spatio-temporal interest point (STIP) Fisher vector action recognition |
author_facet |
Kaibo Yao Nong Sang Changxin Gao |
author_sort |
Kaibo Yao |
title |
A Cuboid Bi-Level Log Operator for Action Classification |
title_short |
A Cuboid Bi-Level Log Operator for Action Classification |
title_full |
A Cuboid Bi-Level Log Operator for Action Classification |
title_fullStr |
A Cuboid Bi-Level Log Operator for Action Classification |
title_full_unstemmed |
A Cuboid Bi-Level Log Operator for Action Classification |
title_sort |
cuboid bi-level log operator for action classification |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
We propose a 3-D cuboid bi-level Laplacian-of-Gaussian (CBLoG) operator with high speed and invariant space-time scale for detecting abrupt changes of signals in videos. This 3-D CBLoG operator is applied in the detection of spatio-temporal interest points in videos. Then, we extract rich feature descriptors, including HOG, HOF, and MBH (motion boundary histogram) around the detected points, and use Fisher vector to encoding these descriptors and their combinations as action representation. In our experiments, we evaluated our recognition method on two standard data sets: KTH and UCF Sport action data sets. The best accuracy on the KTH data set achieves at 99.3%, and on the UCF Sports data set, it is 90.0%. Compared with the existing approaches, our method outperforms the state of the art tested on the KTH data set and achieves impressive performance on the UCF sports data set. |
topic |
Cuboid bi-level LoG (CBLoG) operator spatio-temporal interest point (STIP) Fisher vector action recognition |
url |
https://ieeexplore.ieee.org/document/8470086/ |
work_keys_str_mv |
AT kaiboyao acuboidbilevellogoperatorforactionclassification AT nongsang acuboidbilevellogoperatorforactionclassification AT changxingao acuboidbilevellogoperatorforactionclassification AT kaiboyao cuboidbilevellogoperatorforactionclassification AT nongsang cuboidbilevellogoperatorforactionclassification AT changxingao cuboidbilevellogoperatorforactionclassification |
_version_ |
1724193331429244928 |