Text Extraction for Lecture Videos with Complicated Background
碩士 === 元智大學 === 資訊工程學系 === 99 === In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lectur...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/62867350530367525320 |
id |
ndltd-TW-099YZU05392009 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099YZU053920092016-04-13T04:16:58Z http://ndltd.ncl.edu.tw/handle/62867350530367525320 Text Extraction for Lecture Videos with Complicated Background 複雜背景教學視訊串列之文字擷取 Chi-Yuang Shaio 蕭琪元 碩士 元智大學 資訊工程學系 99 In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords. Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis. First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method. 陳淑媛 2011 學位論文 ; thesis 47 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊工程學系 === 99 === In terms of streaming media and internet are used more and more frequently, the era of e-Learning emerges. In the e-learning system, learners can access lecture videos no matter when and where. Thus, it is imperative to provide an effective method to retrieve lecture videos conveniently and friendly. In the thesis, text extraction for lecture videos with complicated background is proposed to facilitate lecture video retrieval using textual keywords.
Since background of lecture videos may be rather complicated and fancy, in particular may have textual characteristics, foreground segmentation method is designed to extract texts region. On the other hand, since the resolution of lecture videos is generally low, how to enhance the quality of texts to facilitate the consequent text recognition is the other issue in this thesis.
First, temporal analysis of lecture videos is performed to detect slide transitions. The frames corresponding to those frames between slide transitions are then merged into a key frame to represent the slide. The consequent process can then be applied to the key frame only so as to reduce computing time. Second, local features are extracted from block partition of slide-like key frame, based on which background model are generated followed by foreground extraction. Finally, for each text region extracted from foregrounds, quality improvement and adaptive binarization are employed to facilitate consequent optical character recognition. The recognition accuracy rate is used to evaluate the performance of the proposed method and to compare with existing methods. Various experiments prove that the effectiveness and feasibility of our method.
|
author2 |
陳淑媛 |
author_facet |
陳淑媛 Chi-Yuang Shaio 蕭琪元 |
author |
Chi-Yuang Shaio 蕭琪元 |
spellingShingle |
Chi-Yuang Shaio 蕭琪元 Text Extraction for Lecture Videos with Complicated Background |
author_sort |
Chi-Yuang Shaio |
title |
Text Extraction for Lecture Videos with Complicated Background |
title_short |
Text Extraction for Lecture Videos with Complicated Background |
title_full |
Text Extraction for Lecture Videos with Complicated Background |
title_fullStr |
Text Extraction for Lecture Videos with Complicated Background |
title_full_unstemmed |
Text Extraction for Lecture Videos with Complicated Background |
title_sort |
text extraction for lecture videos with complicated background |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/62867350530367525320 |
work_keys_str_mv |
AT chiyuangshaio textextractionforlecturevideoswithcomplicatedbackground AT xiāoqíyuán textextractionforlecturevideoswithcomplicatedbackground AT chiyuangshaio fùzábèijǐngjiàoxuéshìxùnchuànlièzhīwénzìxiéqǔ AT xiāoqíyuán fùzábèijǐngjiàoxuéshìxùnchuànlièzhīwénzìxiéqǔ |
_version_ |
1718222537749430272 |