Text Extraction for Lecture Videos with Complicated Background

碩士 === 元智大學 === 資訊工程學系 === 99 === In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lectur...

Full description

Bibliographic Details
Main Authors:	Chi-Yuang Shaio, 蕭琪元
Other Authors:	陳淑媛
Format:	Others
Language:	en_US
Published:	2011
Online Access:	http://ndltd.ncl.edu.tw/handle/62867350530367525320

id	ndltd-TW-099YZU05392009
record_format	oai_dc
spelling	ndltd-TW-099YZU053920092016-04-13T04:16:58Z http://ndltd.ncl.edu.tw/handle/62867350530367525320 Text Extraction for Lecture Videos with Complicated Background 複雜背景教學視訊串列之文字擷取 Chi-Yuang Shaio 蕭琪元碩士元智大學資訊工程學系 99 In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords. Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis. First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method. 陳淑媛 2011 學位論文 ; thesis 47 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 元智大學 === 資訊工程學系 === 99 === In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords. Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis. First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method.
author2	陳淑媛
author_facet	陳淑媛 Chi-Yuang Shaio 蕭琪元
author	Chi-Yuang Shaio 蕭琪元
spellingShingle	Chi-Yuang Shaio 蕭琪元 Text Extraction for Lecture Videos with Complicated Background
author_sort	Chi-Yuang Shaio
title	Text Extraction for Lecture Videos with Complicated Background
title_short	Text Extraction for Lecture Videos with Complicated Background
title_full	Text Extraction for Lecture Videos with Complicated Background
title_fullStr	Text Extraction for Lecture Videos with Complicated Background
title_full_unstemmed	Text Extraction for Lecture Videos with Complicated Background
title_sort	text extraction for lecture videos with complicated background
publishDate	2011
url	http://ndltd.ncl.edu.tw/handle/62867350530367525320
work_keys_str_mv	AT chiyuangshaio textextractionforlecturevideoswithcomplicatedbackground AT xiāoqíyuán textextractionforlecturevideoswithcomplicatedbackground AT chiyuangshaio fùzábèijǐngjiàoxuéshìxùnchuànlièzhīwénzìxiéqǔ AT xiāoqíyuán fùzábèijǐngjiàoxuéshìxùnchuànlièzhīwénzìxiéqǔ
_version_	1718222537749430272

Text Extraction for Lecture Videos with Complicated Background

Similar Items