A Novel Video Annotation by Integrating Visual Features and Frequent Patterns
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 94 === The major purpose of multimedia data retrieval is to hunt the related multimedia data effectively and efficiently. From human perspective, unfortunately, there is usually not enough semantic support to help users get the accurate results by using only low-le...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/93307351241498076911 |
id |
ndltd-TW-094NCKU5392029 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-094NCKU53920292016-05-30T04:22:00Z http://ndltd.ncl.edu.tw/handle/93307351241498076911 A Novel Video Annotation by Integrating Visual Features and Frequent Patterns 整合視覺特徵與頻繁項目集之視訊註釋方式 Jhih-Hong Huang 黃志鴻 碩士 國立成功大學 資訊工程學系碩博士班 94 The major purpose of multimedia data retrieval is to hunt the related multimedia data effectively and efficiently. From human perspective, unfortunately, there is usually not enough semantic support to help users get the accurate results by using only low-level visual features as done in traditional studies. Generally speaking annotation can be a solution for enhancing the accuracy of multimedia data retrieval. Video annotation has been considered as a challenging research topic since videos carry complex scenic semantics in addition to visual features. In this paper, we propose a novel method for visual features and frequent patterns exists in the video. Our proposed method consists of two main phases: 1) construction of three kinds of prediction models, namely association, sequential and statistical models from annotated videos, and 2) fusion of these models for annotating unknown videos automatically. The main advantage of the proposed method lies in that both of visual features and semantic patterns are considered simultaneously through fusion approach so as to enhance the accuracy of annotation. Empirical evaluations show that our approach is very promising in enhancing the annotation accuracy in terms of precision and recall. Shin-Mu Tseng 曾新穆 2006 學位論文 ; thesis 61 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 94 === The major purpose of multimedia data retrieval is to hunt the related multimedia data effectively and efficiently. From human perspective, unfortunately, there is usually not enough semantic support to help users get the accurate results by using only low-level visual features as done in traditional studies. Generally speaking annotation can be a solution for enhancing the accuracy of multimedia data retrieval. Video annotation has been considered as a challenging research topic since videos carry complex scenic semantics in addition to visual features. In this paper, we propose a novel method for visual features and frequent patterns exists in the video. Our proposed method consists of two main phases: 1) construction of three kinds of prediction models, namely association, sequential and statistical models from annotated videos, and 2) fusion of these models for annotating unknown videos automatically. The main advantage of the proposed method lies in that both of visual features and semantic patterns are considered simultaneously through fusion approach so as to enhance the accuracy of annotation. Empirical evaluations show that our approach is very promising in enhancing the annotation accuracy in terms of precision and recall.
|
author2 |
Shin-Mu Tseng |
author_facet |
Shin-Mu Tseng Jhih-Hong Huang 黃志鴻 |
author |
Jhih-Hong Huang 黃志鴻 |
spellingShingle |
Jhih-Hong Huang 黃志鴻 A Novel Video Annotation by Integrating Visual Features and Frequent Patterns |
author_sort |
Jhih-Hong Huang |
title |
A Novel Video Annotation by Integrating Visual Features and Frequent Patterns |
title_short |
A Novel Video Annotation by Integrating Visual Features and Frequent Patterns |
title_full |
A Novel Video Annotation by Integrating Visual Features and Frequent Patterns |
title_fullStr |
A Novel Video Annotation by Integrating Visual Features and Frequent Patterns |
title_full_unstemmed |
A Novel Video Annotation by Integrating Visual Features and Frequent Patterns |
title_sort |
novel video annotation by integrating visual features and frequent patterns |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/93307351241498076911 |
work_keys_str_mv |
AT jhihhonghuang anovelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns AT huángzhìhóng anovelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns AT jhihhonghuang zhěnghéshìjuétèzhēngyǔpínfánxiàngmùjízhīshìxùnzhùshìfāngshì AT huángzhìhóng zhěnghéshìjuétèzhēngyǔpínfánxiàngmùjízhīshìxùnzhùshìfāngshì AT jhihhonghuang novelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns AT huángzhìhóng novelvideoannotationbyintegratingvisualfeaturesandfrequentpatterns |
_version_ |
1718285702447235072 |