Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos

博士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 99 === With the popularization of digital camera and recorder, the amount of multimedia video content increases rapidly. The viewer will take a lot of time to exhaustively search for a specific video clip without proper management and annotation of all his/her video...

Full description

Bibliographic Details
Main Authors:	Min-Chun Hu, 胡敏君
Other Authors:	吳家麟
Format:	Others
Language:	en_US
Published:	2011
Online Access:	http://ndltd.ncl.edu.tw/handle/67879267229328864355

id	ndltd-TW-099NTU05641010
record_format	oai_dc
spelling	ndltd-TW-099NTU056410102015-10-16T04:02:50Z http://ndltd.ncl.edu.tw/handle/67879267229328864355 Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos 基於時空關聯性之運動影片事件與戰術語義分析 Min-Chun Hu 胡敏君博士國立臺灣大學資訊網路與多媒體研究所 99 With the popularization of digital camera and recorder, the amount of multimedia video content increases rapidly. The viewer will take a lot of time to exhaustively search for a specific video clip without proper management and annotation of all his/her video content. Consequently, videos will be kept in the storage device permanently and finally lose their value. Recently, techniques of automatic video semantic analysis have been proposed for different types of videos. Among all kinds of videos, sports videos are rich in highlights which can be extracted for many applications. In this dissertation, we proposed a comprehensive framework of sports video analysis purposing to bridge the gap between low-level features and high-level semantics in a video. Combining domain knowledge and spatial-temporal correlation of media features, we extract important semantics of events and tactics from sports videos. There is a great diversity in the audiences of sports video, and we roughly categorize them into three groups, i.e., the general audiences who desire exciting highlights, the beginner who crave strategies or skills of how to play, and the professionals who dig into tactics of the opponents. The proposed framework extracts various kinds of semantics to meet their requirements correspondingly. Three main modules, including media feature extraction, explicit event/tactics detection based on high-level media features, and implicit semantic concept retrieval based on trajectory similarity, are studied in this dissertation. For the media feature extraction module, we develop several kinds of robust object detectors to generate mid/high-level media features for concept detection/retrieval. Combining representative high-level media features and the given domain knowledge, the concept detection/retrieval modules are able to acquire spatial-temporal related semantics that hardly be discovered by low-level media features. Moreover, we explore the possibility of extracting implicit events/tactics which are usually ignored by conventional event detection models. In sports videos, these implicit concepts are related to the movements of players and ball. We design an interactive interface for the user to input query trajectories, and the technique of trajectory similarity comparison is utilized to recommend the video clips having the best matched object trajectories. We realized the proposed framework on three types of sports videos, including tennis video, billiards video, and basketball video, to demonstrate the feasibility of the proposed sports video analysis framework. 吳家麟 2011 學位論文 ; thesis 115 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	博士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 99 === With the popularization of digital camera and recorder, the amount of multimedia video content increases rapidly. The viewer will take a lot of time to exhaustively search for a specific video clip without proper management and annotation of all his/her video content. Consequently, videos will be kept in the storage device permanently and finally lose their value. Recently, techniques of automatic video semantic analysis have been proposed for different types of videos. Among all kinds of videos, sports videos are rich in highlights which can be extracted for many applications. In this dissertation, we proposed a comprehensive framework of sports video analysis purposing to bridge the gap between low-level features and high-level semantics in a video. Combining domain knowledge and spatial-temporal correlation of media features, we extract important semantics of events and tactics from sports videos. There is a great diversity in the audiences of sports video, and we roughly categorize them into three groups, i.e., the general audiences who desire exciting highlights, the beginner who crave strategies or skills of how to play, and the professionals who dig into tactics of the opponents. The proposed framework extracts various kinds of semantics to meet their requirements correspondingly. Three main modules, including media feature extraction, explicit event/tactics detection based on high-level media features, and implicit semantic concept retrieval based on trajectory similarity, are studied in this dissertation. For the media feature extraction module, we develop several kinds of robust object detectors to generate mid/high-level media features for concept detection/retrieval. Combining representative high-level media features and the given domain knowledge, the concept detection/retrieval modules are able to acquire spatial-temporal related semantics that hardly be discovered by low-level media features. Moreover, we explore the possibility of extracting implicit events/tactics which are usually ignored by conventional event detection models. In sports videos, these implicit concepts are related to the movements of players and ball. We design an interactive interface for the user to input query trajectories, and the technique of trajectory similarity comparison is utilized to recommend the video clips having the best matched object trajectories. We realized the proposed framework on three types of sports videos, including tennis video, billiards video, and basketball video, to demonstrate the feasibility of the proposed sports video analysis framework.
author2	吳家麟
author_facet	吳家麟 Min-Chun Hu 胡敏君
author	Min-Chun Hu 胡敏君
spellingShingle	Min-Chun Hu 胡敏君 Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos
author_sort	Min-Chun Hu
title	Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos
title_short	Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos
title_full	Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos
title_fullStr	Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos
title_full_unstemmed	Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos
title_sort	spatial-temporal correlation based event and tactics semantics analyses for sports videos
publishDate	2011
url	http://ndltd.ncl.edu.tw/handle/67879267229328864355
work_keys_str_mv	AT minchunhu spatialtemporalcorrelationbasedeventandtacticssemanticsanalysesforsportsvideos AT húmǐnjūn spatialtemporalcorrelationbasedeventandtacticssemanticsanalysesforsportsvideos AT minchunhu jīyúshíkōngguānliánxìngzhīyùndòngyǐngpiànshìjiànyǔzhànshùyǔyìfēnxī AT húmǐnjūn jīyúshíkōngguānliánxìngzhīyùndòngyǐngpiànshìjiànyǔzhànshùyǔyìfēnxī
_version_	1718091595133222912

Spatial-temporal Correlation based Event and Tactics Semantics Analyses for Sports Videos

Similar Items