Text Extraction for Lecture Videos with Complicated Background

碩士 === 元智大學 === 資訊工程學系 === 99 === In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lectur...

Full description

Bibliographic Details
Main Authors: Chi-Yuang Shaio, 蕭琪元
Other Authors: 陳淑媛
Format: Others
Language:en_US
Published: 2011
Online Access:http://ndltd.ncl.edu.tw/handle/62867350530367525320
id ndltd-TW-099YZU05392009
record_format oai_dc
spelling ndltd-TW-099YZU053920092016-04-13T04:16:58Z http://ndltd.ncl.edu.tw/handle/62867350530367525320 Text Extraction for Lecture Videos with Complicated Background 複雜背景教學視訊串列之文字擷取 Chi-Yuang Shaio 蕭琪元 碩士 元智大學 資訊工程學系 99 In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords. Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis. First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method. 陳淑媛 2011 學位論文 ; thesis 47 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 元智大學 === 資訊工程學系 === 99 === In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords. Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis. First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method.
author2 陳淑媛
author_facet 陳淑媛
Chi-Yuang Shaio
蕭琪元
author Chi-Yuang Shaio
蕭琪元
spellingShingle Chi-Yuang Shaio
蕭琪元
Text Extraction for Lecture Videos with Complicated Background
author_sort Chi-Yuang Shaio
title Text Extraction for Lecture Videos with Complicated Background
title_short Text Extraction for Lecture Videos with Complicated Background
title_full Text Extraction for Lecture Videos with Complicated Background
title_fullStr Text Extraction for Lecture Videos with Complicated Background
title_full_unstemmed Text Extraction for Lecture Videos with Complicated Background
title_sort text extraction for lecture videos with complicated background
publishDate 2011
url http://ndltd.ncl.edu.tw/handle/62867350530367525320
work_keys_str_mv AT chiyuangshaio textextractionforlecturevideoswithcomplicatedbackground
AT xiāoqíyuán textextractionforlecturevideoswithcomplicatedbackground
AT chiyuangshaio fùzábèijǐngjiàoxuéshìxùnchuànlièzhīwénzìxiéqǔ
AT xiāoqíyuán fùzábèijǐngjiàoxuéshìxùnchuànlièzhīwénzìxiéqǔ
_version_ 1718222537749430272