Active Video Hashing via Structure Information Learning for Activity Analysis
When attempting to analyze and understand large-scale video datasets, choosing training videos using active learning can significantly reduce the annotation costs associated with supervised learning without sacrificing the accuracy of classifiers. However, to further reduce the computational overhea...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9093899/ |
id |
doaj-16a4f282c2804e21a86877599401f313 |
---|---|
record_format |
Article |
spelling |
doaj-16a4f282c2804e21a86877599401f3132021-03-30T02:15:18ZengIEEEIEEE Access2169-35362020-01-018964289643710.1109/ACCESS.2020.29947839093899Active Video Hashing via Structure Information Learning for Activity AnalysisXiangdong Wang0https://orcid.org/0000-0002-4049-2666Qiuxu Wang1Hui Wang2School of Physical Education, Shandong Normal University, Jinan, ChinaSchool of Physical Education, Shandong Normal University, Jinan, ChinaSchool of Physical Education, Shandong Normal University, Jinan, ChinaWhen attempting to analyze and understand large-scale video datasets, choosing training videos using active learning can significantly reduce the annotation costs associated with supervised learning without sacrificing the accuracy of classifiers. However, to further reduce the computational overhead of exhaustive comparisons between low-level visual feature with high dimensions, we have developed a novel video hashing coding model based on an active learning framework. The model optimizes mean average precision in a straightforward way by explicitly considering the structure information in video clips when learning the optimal hash functions. The structure information considered includes the temporal consistency between successive frames and the local visual patterns shared by videos with same semantic labels. Rather than relying on the similarity between paired videos, we use a ranking-based loss function to directly optimize mean average precision. Then, combined with the active learning component, we jointly evaluate the unlabeled training videos according to the uncertainty and average precision values with an efficient algorithm based on structured SVM. Extensive experimental results over several benchmark datasets demonstrate that our approach produces significantly higher search accuracy than traditional query refinement schemes.https://ieeexplore.ieee.org/document/9093899/Active learningvideo analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiangdong Wang Qiuxu Wang Hui Wang |
spellingShingle |
Xiangdong Wang Qiuxu Wang Hui Wang Active Video Hashing via Structure Information Learning for Activity Analysis IEEE Access Active learning video analysis |
author_facet |
Xiangdong Wang Qiuxu Wang Hui Wang |
author_sort |
Xiangdong Wang |
title |
Active Video Hashing via Structure Information Learning for Activity Analysis |
title_short |
Active Video Hashing via Structure Information Learning for Activity Analysis |
title_full |
Active Video Hashing via Structure Information Learning for Activity Analysis |
title_fullStr |
Active Video Hashing via Structure Information Learning for Activity Analysis |
title_full_unstemmed |
Active Video Hashing via Structure Information Learning for Activity Analysis |
title_sort |
active video hashing via structure information learning for activity analysis |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
When attempting to analyze and understand large-scale video datasets, choosing training videos using active learning can significantly reduce the annotation costs associated with supervised learning without sacrificing the accuracy of classifiers. However, to further reduce the computational overhead of exhaustive comparisons between low-level visual feature with high dimensions, we have developed a novel video hashing coding model based on an active learning framework. The model optimizes mean average precision in a straightforward way by explicitly considering the structure information in video clips when learning the optimal hash functions. The structure information considered includes the temporal consistency between successive frames and the local visual patterns shared by videos with same semantic labels. Rather than relying on the similarity between paired videos, we use a ranking-based loss function to directly optimize mean average precision. Then, combined with the active learning component, we jointly evaluate the unlabeled training videos according to the uncertainty and average precision values with an efficient algorithm based on structured SVM. Extensive experimental results over several benchmark datasets demonstrate that our approach produces significantly higher search accuracy than traditional query refinement schemes. |
topic |
Active learning video analysis |
url |
https://ieeexplore.ieee.org/document/9093899/ |
work_keys_str_mv |
AT xiangdongwang activevideohashingviastructureinformationlearningforactivityanalysis AT qiuxuwang activevideohashingviastructureinformationlearningforactivityanalysis AT huiwang activevideohashingviastructureinformationlearningforactivityanalysis |
_version_ |
1724185491482345472 |