AUTOMATIC EXTRACTION OF TEXT IN IMAGES
碩士 === 大同大學 === 電機工程學系(所) === 92 === Text detection and extraction is an active research topic in recent years. It has many practical applications including multimedia systems, digital libraries, and geographical information systems. In this thesis, we present a system, based on some generic featur...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/51693396047378099777 |
id |
ndltd-TW-092TTU00442037 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-092TTU004420372016-06-15T04:17:09Z http://ndltd.ncl.edu.tw/handle/51693396047378099777 AUTOMATIC EXTRACTION OF TEXT IN IMAGES 影像中的文字之自動粹取 Chih-Hao Chen 陳志豪 碩士 大同大學 電機工程學系(所) 92 Text detection and extraction is an active research topic in recent years. It has many practical applications including multimedia systems, digital libraries, and geographical information systems. In this thesis, we present a system, based on some generic features of text including homogeneity, physical constraints, and dominant stroke orientations, to detect and extract overlay text in digital images. First, we adopt the characteristic of dominant stroke orientations to devise an edge detector suitable for detecting edges along these dominant orientations. Next, physical constraints are induced to design an edge enhancer capable of extracting dense clusters containing text edges. Then physical constraints are further imposed to devise an text region detector consisting of (i) cluster parsing and regrouping which organize text in suitable units, (ii) a skew rectification which aligns the text region horizontally , and (iii) a text region verification which filters out false alarms of non-text regions. Finally, a specialized filter based on the features of dominant stroke orientations and an optimal thresholding based on the property of homogeneity are adopted to extract text embedded in complex backgrounds. We have applied the proposed system to detect and extract text of different languages in images. Experimental results show that it is robust for contrast, font-size, font-color, and background complexity Jia-Ching Cheng 鄭嘉慶 2004 學位論文 ; thesis 41 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 大同大學 === 電機工程學系(所) === 92 === Text detection and extraction is an active research topic in recent years. It has many practical applications including multimedia systems, digital libraries, and geographical information systems.
In this thesis, we present a system, based on some generic features of text including homogeneity, physical constraints, and dominant stroke orientations, to detect and extract overlay text in digital images. First, we adopt the characteristic of dominant stroke orientations to devise an edge detector suitable for detecting edges along these dominant orientations. Next, physical constraints are induced to design an edge enhancer capable of extracting dense clusters containing text edges. Then physical constraints are further imposed to devise an text region detector consisting of (i) cluster parsing and regrouping which organize text in suitable units, (ii) a skew rectification which aligns the text region horizontally , and (iii) a text region verification which filters out false alarms of non-text regions. Finally, a specialized filter based on the features of dominant stroke orientations and an optimal thresholding based on the property of homogeneity are adopted to extract text embedded in complex backgrounds.
We have applied the proposed system to detect and extract text of different languages in images. Experimental results show that it is robust for contrast, font-size, font-color, and background complexity
|
author2 |
Jia-Ching Cheng |
author_facet |
Jia-Ching Cheng Chih-Hao Chen 陳志豪 |
author |
Chih-Hao Chen 陳志豪 |
spellingShingle |
Chih-Hao Chen 陳志豪 AUTOMATIC EXTRACTION OF TEXT IN IMAGES |
author_sort |
Chih-Hao Chen |
title |
AUTOMATIC EXTRACTION OF TEXT IN IMAGES |
title_short |
AUTOMATIC EXTRACTION OF TEXT IN IMAGES |
title_full |
AUTOMATIC EXTRACTION OF TEXT IN IMAGES |
title_fullStr |
AUTOMATIC EXTRACTION OF TEXT IN IMAGES |
title_full_unstemmed |
AUTOMATIC EXTRACTION OF TEXT IN IMAGES |
title_sort |
automatic extraction of text in images |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/51693396047378099777 |
work_keys_str_mv |
AT chihhaochen automaticextractionoftextinimages AT chénzhìháo automaticextractionoftextinimages AT chihhaochen yǐngxiàngzhōngdewénzìzhīzìdòngcuìqǔ AT chénzhìháo yǐngxiàngzhōngdewénzìzhīzìdòngcuìqǔ |
_version_ |
1718305422041939968 |