A Novel Video Annotation by Integrating Visual Features and Frequent Patterns

碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 94 ===   The major purpose of multimedia data retrieval is to hunt the related multimedia data effectively and efficiently. From human perspective, unfortunately, there is usually not enough semantic support to help users get the accurate results by using only low-le...

Full description

Bibliographic Details
Main Authors: Jhih-Hong Huang, 黃志鴻
Other Authors: Shin-Mu Tseng
Format: Others
Language:zh-TW
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/93307351241498076911
id ndltd-TW-094NCKU5392029
record_format oai_dc
spelling ndltd-TW-094NCKU53920292016-05-30T04:22:00Z http://ndltd.ncl.edu.tw/handle/93307351241498076911 A Novel Video Annotation by Integrating Visual Features and Frequent Patterns 整合視覺特徵與頻繁項目集之視訊註釋方式 Jhih-Hong Huang 黃志鴻 碩士 國立成功大學 資訊工程學系碩博士班 94   The major purpose of multimedia data retrieval is to hunt the related multimedia data effectively and efficiently. From human perspective, unfortunately, there is usually not enough semantic support to help users get the accurate results by using only low-level visual features as done in traditional studies. Generally speaking annotation can be a solution for enhancing the accuracy of multimedia data retrieval. Video annotation has been considered as a challenging research topic since videos carry complex scenic semantics in addition to visual features. In this paper, we propose a novel method for visual features and frequent patterns exists in the video. Our proposed method consists of two main phases: 1) construction of three kinds of prediction models, namely association, sequential and statistical models from annotated videos, and 2) fusion of these models for annotating unknown videos automatically. The main advantage of the proposed method lies in that both of visual features and semantic patterns are considered simultaneously through fusion approach so as to enhance the accuracy of annotation. Empirical evaluations show that our approach is very promising in enhancing the annotation accuracy in terms of precision and recall. Shin-Mu Tseng 曾新穆 2006 學位論文 ; thesis 61 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 94 ===   The major purpose of multimedia data retrieval is to hunt the related multimedia data effectively and efficiently. From human perspective, unfortunately, there is usually not enough semantic support to help users get the accurate results by using only low-level visual features as done in traditional studies. Generally speaking annotation can be a solution for enhancing the accuracy of multimedia data retrieval. Video annotation has been considered as a challenging research topic since videos carry complex scenic semantics in addition to visual features. In this paper, we propose a novel method for visual features and frequent patterns exists in the video. Our proposed method consists of two main phases: 1) construction of three kinds of prediction models, namely association, sequential and statistical models from annotated videos, and 2) fusion of these models for annotating unknown videos automatically. The main advantage of the proposed method lies in that both of visual features and semantic patterns are considered simultaneously through fusion approach so as to enhance the accuracy of annotation. Empirical evaluations show that our approach is very promising in enhancing the annotation accuracy in terms of precision and recall.
author2 Shin-Mu Tseng
author_facet Shin-Mu Tseng
Jhih-Hong Huang
黃志鴻
author Jhih-Hong Huang
黃志鴻
spellingShingle Jhih-Hong Huang
黃志鴻
A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
author_sort Jhih-Hong Huang
title A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
title_short A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
title_full A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
title_fullStr A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
title_full_unstemmed A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
title_sort novel video annotation by integrating visual features and frequent patterns
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/93307351241498076911
work_keys_str_mv AT jhihhonghuang anovelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns
AT huángzhìhóng anovelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns
AT jhihhonghuang zhěnghéshìjuétèzhēngyǔpínfánxiàngmùjízhīshìxùnzhùshìfāngshì
AT huángzhìhóng zhěnghéshìjuétèzhēngyǔpínfánxiàngmùjízhīshìxùnzhùshìfāngshì
AT jhihhonghuang novelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns
AT huángzhìhóng novelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns
_version_ 1718285702447235072