A digital image preprocessor for optical character recognition

Optical character recognition requires that data acquired from a camera device first be reconditioned into a suitable form. This thesis presents a design study of a preprocessor intended to accomplish this task under the constraints of real-time, autonomous operation as part of a reading machine for...

Full description

Bibliographic Details
Main Author: Lunscher, Wolfram H.H.J.
Language:English
Published: 2010
Online Access:http://hdl.handle.net/2429/24077
id ndltd-UBC-oai-circle.library.ubc.ca-2429-24077
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-240772018-01-05T17:42:27Z A digital image preprocessor for optical character recognition Lunscher, Wolfram H.H.J. Optical character recognition requires that data acquired from a camera device first be reconditioned into a suitable form. This thesis presents a design study of a preprocessor intended to accomplish this task under the constraints of real-time, autonomous operation as part of a reading machine for the blind. The preprocessor contains three components: a filter to reduce the influence of noise and enhance the text seen; a binarization stage to identify the character and background pixels by a single bit; and a segmention system which isolates individual characters for delivery to the recognizer in proper causal order. Filtering of the acquired video information is performed with the Laplacian of a Gaussian, V²g, edge detection operator developed by D. Marr. This filter is shown to locate the character edges and to control the amount of detail seen optimally. Developed from models of the human visual system, this filter promises that the preprocessor could attain a human text resolution capability. Binarization is also reduced to a simple thresholding of the filter's output. To achieve optimal enhancement of print structures of known dimensions a filter design strategy is presented incorporating two periodic edge models. Since the filter must be digitized, the design method is further extended to include an analysis of the effects of sampling the continuous filter and quantizing its coefficients for a direct-form finite impulse response implementation. To validate the claim that this filter's performance is superior to other edge detection methods, a chapter is devoted to quantitative evaluation of V²g filtered test images. The results are compared to others published and found superior, as well as remarkably noise immune. Segmentation of the binarized text is accomplished with a technique adapted from the binary-image description method of C.T. Zahn. The text images are reduced to a hybrid chain-code-and-coordinate description of all internal and external boundaries concurrent with the incoming raster-acquired data. No more than two image scan lines need be stored. All closed borders within the text are detected immediately as they occur by monitoring the local changes in the image's Euler number during a scan, and verifying global closure by following pointers through a linked-list data structure. A simulation of an implementation of the segmentation system, operating on fifty V²g filtered test images, indicated that the real-time performance objective can be achieved through a combined serial and parallel architecture. Applied Science, Faculty of Electrical and Computer Engineering, Department of Graduate 2010-04-22T22:51:06Z 2010-04-22T22:51:06Z 1983 Text Thesis/Dissertation http://hdl.handle.net/2429/24077 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
collection NDLTD
language English
sources NDLTD
description Optical character recognition requires that data acquired from a camera device first be reconditioned into a suitable form. This thesis presents a design study of a preprocessor intended to accomplish this task under the constraints of real-time, autonomous operation as part of a reading machine for the blind. The preprocessor contains three components: a filter to reduce the influence of noise and enhance the text seen; a binarization stage to identify the character and background pixels by a single bit; and a segmention system which isolates individual characters for delivery to the recognizer in proper causal order. Filtering of the acquired video information is performed with the Laplacian of a Gaussian, V²g, edge detection operator developed by D. Marr. This filter is shown to locate the character edges and to control the amount of detail seen optimally. Developed from models of the human visual system, this filter promises that the preprocessor could attain a human text resolution capability. Binarization is also reduced to a simple thresholding of the filter's output. To achieve optimal enhancement of print structures of known dimensions a filter design strategy is presented incorporating two periodic edge models. Since the filter must be digitized, the design method is further extended to include an analysis of the effects of sampling the continuous filter and quantizing its coefficients for a direct-form finite impulse response implementation. To validate the claim that this filter's performance is superior to other edge detection methods, a chapter is devoted to quantitative evaluation of V²g filtered test images. The results are compared to others published and found superior, as well as remarkably noise immune. Segmentation of the binarized text is accomplished with a technique adapted from the binary-image description method of C.T. Zahn. The text images are reduced to a hybrid chain-code-and-coordinate description of all internal and external boundaries concurrent with the incoming raster-acquired data. No more than two image scan lines need be stored. All closed borders within the text are detected immediately as they occur by monitoring the local changes in the image's Euler number during a scan, and verifying global closure by following pointers through a linked-list data structure. A simulation of an implementation of the segmentation system, operating on fifty V²g filtered test images, indicated that the real-time performance objective can be achieved through a combined serial and parallel architecture. === Applied Science, Faculty of === Electrical and Computer Engineering, Department of === Graduate
author Lunscher, Wolfram H.H.J.
spellingShingle Lunscher, Wolfram H.H.J.
A digital image preprocessor for optical character recognition
author_facet Lunscher, Wolfram H.H.J.
author_sort Lunscher, Wolfram H.H.J.
title A digital image preprocessor for optical character recognition
title_short A digital image preprocessor for optical character recognition
title_full A digital image preprocessor for optical character recognition
title_fullStr A digital image preprocessor for optical character recognition
title_full_unstemmed A digital image preprocessor for optical character recognition
title_sort digital image preprocessor for optical character recognition
publishDate 2010
url http://hdl.handle.net/2429/24077
work_keys_str_mv AT lunscherwolframhhj adigitalimagepreprocessorforopticalcharacterrecognition
AT lunscherwolframhhj digitalimagepreprocessorforopticalcharacterrecognition
_version_ 1718592435166117888