Using Slides Homographic Characteristics for Speech Video Segmentation

碩士 === 國立臺灣師範大學 === 資訊工程學系 === 102 === Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add ma...

Full description

Bibliographic Details
Main Authors: You Sheng, Lin, 林祐生
Other Authors: Greg C. Lee
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/86245128242167222302
id ndltd-TW-102NTNU5392019
record_format oai_dc
spelling ndltd-TW-102NTNU53920192016-07-02T04:20:53Z http://ndltd.ncl.edu.tw/handle/86245128242167222302 Using Slides Homographic Characteristics for Speech Video Segmentation 以投影片單應性映射之相關特徵進行演講影片分析研究 You Sheng, Lin 林祐生 碩士 國立臺灣師範大學 資訊工程學系 102 Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add mark to each time stamp in the video is time wasting. In this research , we develop an automatic process to achieve this. By given slides file and video file input, the proposed method will output segmented results. First, we use a heuristic method to eliminate duplicated and similar frames in recorded speech video. Then applying matching process based on SIFT. Then the matched candidates would be filtered by nearest neighbor ranking, which is suggested by D.G. Lowe. After we got matched candidates, a non-slide-frame detection will prune frames without slides displayed. Before output, we refine the recognition results with context scoring machanisims. The applying to a voting schema to improve the results of frame-slides pairs, and were achieved about 96% coverages of slide-frame switches. Greg C. Lee 李忠謀 2014 學位論文 ; thesis 71 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣師範大學 === 資訊工程學系 === 102 === Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add mark to each time stamp in the video is time wasting. In this research , we develop an automatic process to achieve this. By given slides file and video file input, the proposed method will output segmented results. First, we use a heuristic method to eliminate duplicated and similar frames in recorded speech video. Then applying matching process based on SIFT. Then the matched candidates would be filtered by nearest neighbor ranking, which is suggested by D.G. Lowe. After we got matched candidates, a non-slide-frame detection will prune frames without slides displayed. Before output, we refine the recognition results with context scoring machanisims. The applying to a voting schema to improve the results of frame-slides pairs, and were achieved about 96% coverages of slide-frame switches.
author2 Greg C. Lee
author_facet Greg C. Lee
You Sheng, Lin
林祐生
author You Sheng, Lin
林祐生
spellingShingle You Sheng, Lin
林祐生
Using Slides Homographic Characteristics for Speech Video Segmentation
author_sort You Sheng, Lin
title Using Slides Homographic Characteristics for Speech Video Segmentation
title_short Using Slides Homographic Characteristics for Speech Video Segmentation
title_full Using Slides Homographic Characteristics for Speech Video Segmentation
title_fullStr Using Slides Homographic Characteristics for Speech Video Segmentation
title_full_unstemmed Using Slides Homographic Characteristics for Speech Video Segmentation
title_sort using slides homographic characteristics for speech video segmentation
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/86245128242167222302
work_keys_str_mv AT youshenglin usingslideshomographiccharacteristicsforspeechvideosegmentation
AT línyòushēng usingslideshomographiccharacteristicsforspeechvideosegmentation
AT youshenglin yǐtóuyǐngpiàndānyīngxìngyìngshèzhīxiāngguāntèzhēngjìnxíngyǎnjiǎngyǐngpiànfēnxīyánjiū
AT línyòushēng yǐtóuyǐngpiàndānyīngxìngyìngshèzhīxiāngguāntèzhēngjìnxíngyǎnjiǎngyǐngpiànfēnxīyánjiū
_version_ 1718332403132399616