Summary: | 碩士 === 國立交通大學 === 資訊工程系 === 88 === The goal of this thesis is to propose a general Chinese document processing systems which consists of three modules: preprocessing, recognition kernel, and postprocessing. In the preprocessing module, input images probably have small skew angles. These skew angles will affect the performance of character segmentation and character recognition. A skew angle detection method is used and a modified rotate transform is proposed to rotate document images. In our system, sentences and characters must be extracted for recognition engines. For this purpose, document images must be segmented into text blocks, text lines, and character images. After we detect the punctuation marks in the character images, we construct sentences from character images.
In the recognition module, we use two recognition engines to recognize the character images. Contour directional features and crossing count features are selected for kernel 1 and Oka''s cellular features and peripheral background area features are selected for kernel 2. The weights of these kernels and features are related to the relative stroke widths of character images which provide measurements about character image quality. When we construct recognition engines, the features are trained from a character image database selecting from document images. To provide more robust training features to increase the recognition rate, bad features instead of bad images are removed in the feature training process.
In the post-processing module, a simplified language model is used. The model includes word selection bound setting, matching order establishing, fast word matching, and most-confident word selection. By using this model, the processing can be speed-up.
The experiments performed on more than 40 articles images show the system we propose here is very effective and efficient.
|