Using Slides Homographic Characteristics for Speech Video Segmentation

碩士 === 國立臺灣師範大學 === 資訊工程學系 === 102 === Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add ma...

Full description

Bibliographic Details
Main Authors:	You Sheng, Lin, 林祐生
Other Authors:	Greg C. Lee
Format:	Others
Language:	zh-TW
Published:	2014
Online Access:	http://ndltd.ncl.edu.tw/handle/86245128242167222302

id	ndltd-TW-102NTNU5392019
record_format	oai_dc
spelling	ndltd-TW-102NTNU53920192016-07-02T04:20:53Z http://ndltd.ncl.edu.tw/handle/86245128242167222302 Using Slides Homographic Characteristics for Speech Video Segmentation 以投影片單應性映射之相關特徵進行演講影片分析研究 You Sheng, Lin 林祐生碩士國立臺灣師範大學資訊工程學系 102 Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add mark to each time stamp in the video is time wasting. In this research , we develop an automatic process to achieve this. By given slides file and video file input, the proposed method will output segmented results. First, we use a heuristic method to eliminate duplicated and similar frames in recorded speech video. Then applying matching process based on SIFT. Then the matched candidates would be filtered by nearest neighbor ranking, which is suggested by D.G. Lowe. After we got matched candidates, a non-slide-frame detection will prune frames without slides displayed. Before output, we refine the recognition results with context scoring machanisims. The applying to a voting schema to improve the results of frame-slides pairs, and were achieved about 96% coverages of slide-frame switches. Greg C. Lee 李忠謀 2014 學位論文 ; thesis 71 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺灣師範大學 === 資訊工程學系 === 102 === Matching slides with video data frame is a method to provide users a quick way skim over the whole video by any given slide content, and will also help people quickly to jump to any point in the video, which may improve the user expereinces. But manually add mark to each time stamp in the video is time wasting. In this research , we develop an automatic process to achieve this. By given slides file and video file input, the proposed method will output segmented results. First, we use a heuristic method to eliminate duplicated and similar frames in recorded speech video. Then applying matching process based on SIFT. Then the matched candidates would be filtered by nearest neighbor ranking, which is suggested by D.G. Lowe. After we got matched candidates, a non-slide-frame detection will prune frames without slides displayed. Before output, we refine the recognition results with context scoring machanisims. The applying to a voting schema to improve the results of frame-slides pairs, and were achieved about 96% coverages of slide-frame switches.
author2	Greg C. Lee
author_facet	Greg C. Lee You Sheng, Lin 林祐生
author	You Sheng, Lin 林祐生
spellingShingle	You Sheng, Lin 林祐生 Using Slides Homographic Characteristics for Speech Video Segmentation
author_sort	You Sheng, Lin
title	Using Slides Homographic Characteristics for Speech Video Segmentation
title_short	Using Slides Homographic Characteristics for Speech Video Segmentation
title_full	Using Slides Homographic Characteristics for Speech Video Segmentation
title_fullStr	Using Slides Homographic Characteristics for Speech Video Segmentation
title_full_unstemmed	Using Slides Homographic Characteristics for Speech Video Segmentation
title_sort	using slides homographic characteristics for speech video segmentation
publishDate	2014
url	http://ndltd.ncl.edu.tw/handle/86245128242167222302
work_keys_str_mv	AT youshenglin usingslideshomographiccharacteristicsforspeechvideosegmentation AT línyòushēng usingslideshomographiccharacteristicsforspeechvideosegmentation AT youshenglin yǐtóuyǐngpiàndānyīngxìngyìngshèzhīxiāngguāntèzhēngjìnxíngyǎnjiǎngyǐngpiànfēnxīyánjiū AT línyòushēng yǐtóuyǐngpiàndānyīngxìngyìngshèzhīxiāngguāntèzhēngjìnxíngyǎnjiǎngyǐngpiànfēnxīyánjiū
_version_	1718332403132399616

Using Slides Homographic Characteristics for Speech Video Segmentation

Similar Items