Using Slides Homographic Characteristics for Speech Video Segmentation
碩士 === 國立臺灣師範大學 === 資訊工程學系 === 102 === Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add ma...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/86245128242167222302 |
id |
ndltd-TW-102NTNU5392019 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102NTNU53920192016-07-02T04:20:53Z http://ndltd.ncl.edu.tw/handle/86245128242167222302 Using Slides Homographic Characteristics for Speech Video Segmentation 以投影片單應性映射之相關特徵進行演講影片分析研究 You Sheng, Lin 林祐生 碩士 國立臺灣師範大學 資訊工程學系 102 Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add mark to each time stamp in the video is time wasting. In this research , we develop an automatic process to achieve this. By given slides file and video file input, the proposed method will output segmented results. First, we use a heuristic method to eliminate duplicated and similar frames in recorded speech video. Then applying matching process based on SIFT. Then the matched candidates would be filtered by nearest neighbor ranking, which is suggested by D.G. Lowe. After we got matched candidates, a non-slide-frame detection will prune frames without slides displayed. Before output, we refine the recognition results with context scoring machanisims. The applying to a voting schema to improve the results of frame-slides pairs, and were achieved about 96% coverages of slide-frame switches. Greg C. Lee 李忠謀 2014 學位論文 ; thesis 71 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣師範大學 === 資訊工程學系 === 102 === Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add mark to each time stamp in the video is time wasting. In this research , we develop an automatic process to achieve this. By given slides file and video file input, the proposed method will output segmented results.
First, we use a heuristic method to eliminate duplicated and similar frames in recorded speech video. Then applying matching process based on SIFT. Then the matched candidates would be filtered by nearest neighbor ranking, which is suggested by D.G. Lowe. After we got matched candidates, a non-slide-frame detection will prune frames without slides displayed. Before output, we refine the recognition results with context scoring machanisims. The applying to a voting schema to improve the results of frame-slides pairs, and were achieved about 96% coverages of slide-frame switches.
|
author2 |
Greg C. Lee |
author_facet |
Greg C. Lee You Sheng, Lin 林祐生 |
author |
You Sheng, Lin 林祐生 |
spellingShingle |
You Sheng, Lin 林祐生 Using Slides Homographic Characteristics for Speech Video Segmentation |
author_sort |
You Sheng, Lin |
title |
Using Slides Homographic Characteristics for Speech Video Segmentation |
title_short |
Using Slides Homographic Characteristics for Speech Video Segmentation |
title_full |
Using Slides Homographic Characteristics for Speech Video Segmentation |
title_fullStr |
Using Slides Homographic Characteristics for Speech Video Segmentation |
title_full_unstemmed |
Using Slides Homographic Characteristics for Speech Video Segmentation |
title_sort |
using slides homographic characteristics for speech video segmentation |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/86245128242167222302 |
work_keys_str_mv |
AT youshenglin usingslideshomographiccharacteristicsforspeechvideosegmentation AT línyòushēng usingslideshomographiccharacteristicsforspeechvideosegmentation AT youshenglin yǐtóuyǐngpiàndānyīngxìngyìngshèzhīxiāngguāntèzhēngjìnxíngyǎnjiǎngyǐngpiànfēnxīyánjiū AT línyòushēng yǐtóuyǐngpiàndānyīngxìngyìngshèzhīxiāngguāntèzhēngjìnxíngyǎnjiǎngyǐngpiànfēnxīyánjiū |
_version_ |
1718332403132399616 |