AUTOMATIC EXTRACTION OF TEXT IN IMAGES

碩士 === 大同大學 === 電機工程學系(所) === 92 === Text detection and extraction is an active research topic in recent years. It has many practical applications including multimedia systems, digital libraries, and geographical information systems. In this thesis, we present a system, based on some generic featur...

Full description

Bibliographic Details
Main Authors: Chih-Hao Chen, 陳志豪
Other Authors: Jia-Ching Cheng
Format: Others
Language:en_US
Published: 2004
Online Access:http://ndltd.ncl.edu.tw/handle/51693396047378099777
id ndltd-TW-092TTU00442037
record_format oai_dc
spelling ndltd-TW-092TTU004420372016-06-15T04:17:09Z http://ndltd.ncl.edu.tw/handle/51693396047378099777 AUTOMATIC EXTRACTION OF TEXT IN IMAGES 影像中的文字之自動粹取 Chih-Hao Chen 陳志豪 碩士 大同大學 電機工程學系(所) 92 Text detection and extraction is an active research topic in recent years. It has many practical applications including multimedia systems, digital libraries, and geographical information systems. In this thesis, we present a system, based on some generic features of text including homogeneity, physical constraints, and dominant stroke orientations, to detect and extract overlay text in digital images. First, we adopt the characteristic of dominant stroke orientations to devise an edge detector suitable for detecting edges along these dominant orientations. Next, physical constraints are induced to design an edge enhancer capable of extracting dense clusters containing text edges. Then physical constraints are further imposed to devise an text region detector consisting of (i) cluster parsing and regrouping which organize text in suitable units, (ii) a skew rectification which aligns the text region horizontally , and (iii) a text region verification which filters out false alarms of non-text regions. Finally, a specialized filter based on the features of dominant stroke orientations and an optimal thresholding based on the property of homogeneity are adopted to extract text embedded in complex backgrounds. We have applied the proposed system to detect and extract text of different languages in images. Experimental results show that it is robust for contrast, font-size, font-color, and background complexity Jia-Ching Cheng 鄭嘉慶 2004 學位論文 ; thesis 41 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 大同大學 === 電機工程學系(所) === 92 === Text detection and extraction is an active research topic in recent years. It has many practical applications including multimedia systems, digital libraries, and geographical information systems. In this thesis, we present a system, based on some generic features of text including homogeneity, physical constraints, and dominant stroke orientations, to detect and extract overlay text in digital images. First, we adopt the characteristic of dominant stroke orientations to devise an edge detector suitable for detecting edges along these dominant orientations. Next, physical constraints are induced to design an edge enhancer capable of extracting dense clusters containing text edges. Then physical constraints are further imposed to devise an text region detector consisting of (i) cluster parsing and regrouping which organize text in suitable units, (ii) a skew rectification which aligns the text region horizontally , and (iii) a text region verification which filters out false alarms of non-text regions. Finally, a specialized filter based on the features of dominant stroke orientations and an optimal thresholding based on the property of homogeneity are adopted to extract text embedded in complex backgrounds. We have applied the proposed system to detect and extract text of different languages in images. Experimental results show that it is robust for contrast, font-size, font-color, and background complexity
author2 Jia-Ching Cheng
author_facet Jia-Ching Cheng
Chih-Hao Chen
陳志豪
author Chih-Hao Chen
陳志豪
spellingShingle Chih-Hao Chen
陳志豪
AUTOMATIC EXTRACTION OF TEXT IN IMAGES
author_sort Chih-Hao Chen
title AUTOMATIC EXTRACTION OF TEXT IN IMAGES
title_short AUTOMATIC EXTRACTION OF TEXT IN IMAGES
title_full AUTOMATIC EXTRACTION OF TEXT IN IMAGES
title_fullStr AUTOMATIC EXTRACTION OF TEXT IN IMAGES
title_full_unstemmed AUTOMATIC EXTRACTION OF TEXT IN IMAGES
title_sort automatic extraction of text in images
publishDate 2004
url http://ndltd.ncl.edu.tw/handle/51693396047378099777
work_keys_str_mv AT chihhaochen automaticextractionoftextinimages
AT chénzhìháo automaticextractionoftextinimages
AT chihhaochen yǐngxiàngzhōngdewénzìzhīzìdòngcuìqǔ
AT chénzhìháo yǐngxiàngzhōngdewénzìzhīzìdòngcuìqǔ
_version_ 1718305422041939968