A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information

The high speed and low latency of 5G mobile network have accelerated the speed and amount of information transmission. Web video is likely to become the main mode of news production and dissemination in the future for its richer information and more convenient dissemination, which will subvert the t...

Full description

Bibliographic Details
Main Authors:	Chengde Zhang, Dandan Jin, Xia Xiao, Gao Chen, Mei-Ling Shyu
Format:	Article
Language:	English
Published:	IEEE 2020-01-01
Series:	IEEE Access
Subjects:	Event mining web video near-duplicate keyframes (NDKs) topic detection and tracking (TDT)
Online Access:	https://ieeexplore.ieee.org/document/8951020/

id	doaj-6a36996f4fa14a90b7446d11629bf0ec
record_format	Article
spelling	doaj-6a36996f4fa14a90b7446d11629bf0ec2021-03-30T03:03:27ZengIEEEIEEE Access2169-35362020-01-018105161052710.1109/ACCESS.2020.29647148951020A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual InformationChengde Zhang0https://orcid.org/0000-0003-2246-4976Dandan Jin1https://orcid.org/0000-0001-9185-0884Xia Xiao2https://orcid.org/0000-0001-8364-2487Gao Chen3https://orcid.org/0000-0003-3865-0896Mei-Ling Shyu4https://orcid.org/0000-0003-0902-0844School of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, ChinaSchool of Information and Safety Engineering, Zhongnan University of Economics and Law, Wuhan, ChinaLaboratory Management Center, Wuhan Qingchuan University, Wuhan, ChinaSchool of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan, ChinaDepartment of Electrical and Computer Engineering, University of Miami, Coral Gables, FL, USAThe high speed and low latency of 5G mobile network have accelerated the speed and amount of information transmission. Web video is likely to become the main mode of news production and dissemination in the future for its richer information and more convenient dissemination, which will subvert the traditional mode of event mining. Therefore, event mining based on web videos has become a new research hotspot. However, web videos are vulnerable to video editing, lighting, shooting perspective and shooting angle, and other factors, resulting in the inaccurate visual similarity detection problem. Generally speaking, effectively integrating humungous volumes of cross-model information would give a great help. However, web videos are described with few terms, and thus sparse text information becomes a challenge for cross-model information combination. To address this issue, this paper proposes a new collaborative optimization framework with the combination of inaccurate visual similarity detection information and sparse textual information. This framework is composed of three steps. After obtaining the statistics of the distribution characteristics of each word in all Near-Duplicate Keyframes (NDKs), the high-level semantic cross-correlation between NDKs is first mined with the help of textual features, forming a new set of semantic relevant NDKs with different visual expressions. Next, textual distribution features are enriched through finding more semantically related words by the new NDK set with various forms of visual expressions, solving the sparse distribution problem for each word in all NDKs. Finally, Multiple Correspondence Analysis (MCA) is used to mine the events. Experimental results with a large number of real data demonstrate that the proposed model outperforms the existing methods for web video event mining.https://ieeexplore.ieee.org/document/8951020/Event miningweb videonear-duplicate keyframes (NDKs)topic detection and tracking (TDT)
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Chengde Zhang Dandan Jin Xia Xiao Gao Chen Mei-Ling Shyu
spellingShingle	Chengde Zhang Dandan Jin Xia Xiao Gao Chen Mei-Ling Shyu A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information IEEE Access Event mining web video near-duplicate keyframes (NDKs) topic detection and tracking (TDT)
author_facet	Chengde Zhang Dandan Jin Xia Xiao Gao Chen Mei-Ling Shyu
author_sort	Chengde Zhang
title	A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information
title_short	A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information
title_full	A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information
title_fullStr	A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information
title_full_unstemmed	A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information
title_sort	novel collaborative optimization framework for web video event mining based on the combination of inaccurate visual similarity detection information and sparse textual information
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2020-01-01
description	The high speed and low latency of 5G mobile network have accelerated the speed and amount of information transmission. Web video is likely to become the main mode of news production and dissemination in the future for its richer information and more convenient dissemination, which will subvert the traditional mode of event mining. Therefore, event mining based on web videos has become a new research hotspot. However, web videos are vulnerable to video editing, lighting, shooting perspective and shooting angle, and other factors, resulting in the inaccurate visual similarity detection problem. Generally speaking, effectively integrating humungous volumes of cross-model information would give a great help. However, web videos are described with few terms, and thus sparse text information becomes a challenge for cross-model information combination. To address this issue, this paper proposes a new collaborative optimization framework with the combination of inaccurate visual similarity detection information and sparse textual information. This framework is composed of three steps. After obtaining the statistics of the distribution characteristics of each word in all Near-Duplicate Keyframes (NDKs), the high-level semantic cross-correlation between NDKs is first mined with the help of textual features, forming a new set of semantic relevant NDKs with different visual expressions. Next, textual distribution features are enriched through finding more semantically related words by the new NDK set with various forms of visual expressions, solving the sparse distribution problem for each word in all NDKs. Finally, Multiple Correspondence Analysis (MCA) is used to mine the events. Experimental results with a large number of real data demonstrate that the proposed model outperforms the existing methods for web video event mining.
topic	Event mining web video near-duplicate keyframes (NDKs) topic detection and tracking (TDT)
url	https://ieeexplore.ieee.org/document/8951020/
work_keys_str_mv	AT chengdezhang anovelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT dandanjin anovelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT xiaxiao anovelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT gaochen anovelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT meilingshyu anovelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT chengdezhang novelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT dandanjin novelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT xiaxiao novelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT gaochen novelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation AT meilingshyu novelcollaborativeoptimizationframeworkforwebvideoeventminingbasedonthecombinationofinaccuratevisualsimilaritydetectioninformationandsparsetextualinformation
_version_	1724184091149991936

A Novel Collaborative Optimization Framework for Web Video Event Mining Based on the Combination of Inaccurate Visual Similarity Detection Information and Sparse Textual Information

Similar Items