Active Video Hashing via Structure Information Learning for Activity Analysis

When attempting to analyze and understand large-scale video datasets, choosing training videos using active learning can significantly reduce the annotation costs associated with supervised learning without sacrificing the accuracy of classifiers. However, to further reduce the computational overhea...

Full description

Bibliographic Details
Main Authors: Xiangdong Wang, Qiuxu Wang, Hui Wang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9093899/
id doaj-16a4f282c2804e21a86877599401f313
record_format Article
spelling doaj-16a4f282c2804e21a86877599401f3132021-03-30T02:15:18ZengIEEEIEEE Access2169-35362020-01-018964289643710.1109/ACCESS.2020.29947839093899Active Video Hashing via Structure Information Learning for Activity AnalysisXiangdong Wang0https://orcid.org/0000-0002-4049-2666Qiuxu Wang1Hui Wang2School of Physical Education, Shandong Normal University, Jinan, ChinaSchool of Physical Education, Shandong Normal University, Jinan, ChinaSchool of Physical Education, Shandong Normal University, Jinan, ChinaWhen attempting to analyze and understand large-scale video datasets, choosing training videos using active learning can significantly reduce the annotation costs associated with supervised learning without sacrificing the accuracy of classifiers. However, to further reduce the computational overhead of exhaustive comparisons between low-level visual feature with high dimensions, we have developed a novel video hashing coding model based on an active learning framework. The model optimizes mean average precision in a straightforward way by explicitly considering the structure information in video clips when learning the optimal hash functions. The structure information considered includes the temporal consistency between successive frames and the local visual patterns shared by videos with same semantic labels. Rather than relying on the similarity between paired videos, we use a ranking-based loss function to directly optimize mean average precision. Then, combined with the active learning component, we jointly evaluate the unlabeled training videos according to the uncertainty and average precision values with an efficient algorithm based on structured SVM. Extensive experimental results over several benchmark datasets demonstrate that our approach produces significantly higher search accuracy than traditional query refinement schemes.https://ieeexplore.ieee.org/document/9093899/Active learningvideo analysis
collection DOAJ
language English
format Article
sources DOAJ
author Xiangdong Wang
Qiuxu Wang
Hui Wang
spellingShingle Xiangdong Wang
Qiuxu Wang
Hui Wang
Active Video Hashing via Structure Information Learning for Activity Analysis
IEEE Access
Active learning
video analysis
author_facet Xiangdong Wang
Qiuxu Wang
Hui Wang
author_sort Xiangdong Wang
title Active Video Hashing via Structure Information Learning for Activity Analysis
title_short Active Video Hashing via Structure Information Learning for Activity Analysis
title_full Active Video Hashing via Structure Information Learning for Activity Analysis
title_fullStr Active Video Hashing via Structure Information Learning for Activity Analysis
title_full_unstemmed Active Video Hashing via Structure Information Learning for Activity Analysis
title_sort active video hashing via structure information learning for activity analysis
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description When attempting to analyze and understand large-scale video datasets, choosing training videos using active learning can significantly reduce the annotation costs associated with supervised learning without sacrificing the accuracy of classifiers. However, to further reduce the computational overhead of exhaustive comparisons between low-level visual feature with high dimensions, we have developed a novel video hashing coding model based on an active learning framework. The model optimizes mean average precision in a straightforward way by explicitly considering the structure information in video clips when learning the optimal hash functions. The structure information considered includes the temporal consistency between successive frames and the local visual patterns shared by videos with same semantic labels. Rather than relying on the similarity between paired videos, we use a ranking-based loss function to directly optimize mean average precision. Then, combined with the active learning component, we jointly evaluate the unlabeled training videos according to the uncertainty and average precision values with an efficient algorithm based on structured SVM. Extensive experimental results over several benchmark datasets demonstrate that our approach produces significantly higher search accuracy than traditional query refinement schemes.
topic Active learning
video analysis
url https://ieeexplore.ieee.org/document/9093899/
work_keys_str_mv AT xiangdongwang activevideohashingviastructureinformationlearningforactivityanalysis
AT qiuxuwang activevideohashingviastructureinformationlearningforactivityanalysis
AT huiwang activevideohashingviastructureinformationlearningforactivityanalysis
_version_ 1724185491482345472