Schema labelling applied to hand-printed Chinese character recognition
Hand-printed Chinese character recognition presents an interesting problem for Artificial Intelligence research. Input data in the form of arrays of pixel values cannot be directly mapped to unique character identifications because of the complexity of the characters. Thus, intermediate data structu...
Main Author: | |
---|---|
Language: | English |
Published: |
University of British Columbia
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/2429/26175 |
id |
ndltd-UBC-oai-circle.library.ubc.ca-2429-26175 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UBC-oai-circle.library.ubc.ca-2429-261752018-01-05T17:43:30Z Schema labelling applied to hand-printed Chinese character recognition Bult, Timothy Paul Optical pattern recognition Chinese characters Hand-printed Chinese character recognition presents an interesting problem for Artificial Intelligence research. Input data in the form of arrays of pixel values cannot be directly mapped to unique character identifications because of the complexity of the characters. Thus, intermediate data structures are necessary, which in turn lead to a need to represent knowledge of the characters' composition. Building the intermediate constructs for these hand-printed characters necessarily involves choices among ambiguities, the set of which is so large that an efficient search algorithm becomes central to the recognition process. Schema labelling is a theory of how knowledge should be organized for recognition tasks in which composition structure is inherent in the domain, the composition entails ambiguity, and the ambiguity generates large search spaces. This thesis describes an implementation of an enhanced version of schema labelling for Chinese characters. The specific problems addressed by the enhancements, with some success, are (i) the segmentation of real images into objects usable by the schema system, (ii) the definition of schemas which adequately describe the generic composition of hand-printed Chinese characters, as well as common variations or vagaries, and (iii) the inclusion of sufficient "control knowledge" to prevent combinatorial explosion of the backtracking recognition process. Test characters for recognition systems can be classified along several dimensions. On the spectrum from type-set, through hand-printed, to hand-written forms, our system was tested on restricted hand-print, at a level somewhat more difficult than is normally attempted. On the spectrum of input types, from grey-scale pixel input through on-line stroke representations, our system was fully tested only at the high end, with complete synthetic strokes. We obtained a success rate of 57%, 12 out of the 21 characters tested. The principal success of the work is that characters of the complexity tested could be recognized at all, and in the impact schema labelling techniques had on that recognition. Science, Faculty of Computer Science, Department of Graduate 2010-07-07T19:55:28Z 2010-07-07T19:55:28Z 1987 Text Thesis/Dissertation http://hdl.handle.net/2429/26175 eng For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use. University of British Columbia |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
topic |
Optical pattern recognition Chinese characters |
spellingShingle |
Optical pattern recognition Chinese characters Bult, Timothy Paul Schema labelling applied to hand-printed Chinese character recognition |
description |
Hand-printed Chinese character recognition presents an interesting problem for Artificial Intelligence research. Input data in the form of arrays of pixel values cannot be directly mapped to unique character identifications because of the complexity of the characters. Thus, intermediate data structures are necessary, which in turn lead to a need to represent knowledge of the characters' composition. Building the intermediate constructs for these hand-printed characters necessarily involves choices among ambiguities, the set of which is so large that an efficient search algorithm becomes central to the recognition process.
Schema labelling is a theory of how knowledge should be organized for recognition tasks in which composition structure is inherent in the domain, the composition entails ambiguity, and the ambiguity generates large search spaces. This thesis describes an implementation of an enhanced version of schema labelling for Chinese characters. The specific problems addressed by the enhancements, with some success, are (i) the segmentation of real images into objects usable by the schema system, (ii) the definition of schemas which adequately describe the generic composition of hand-printed Chinese characters, as well as common variations or vagaries, and (iii) the inclusion of sufficient "control knowledge" to prevent combinatorial explosion of the backtracking recognition process.
Test characters for recognition systems can be classified along several dimensions. On the spectrum from type-set, through hand-printed, to hand-written forms, our system was tested on restricted hand-print, at a level somewhat more difficult than is normally attempted. On the spectrum of input types, from grey-scale pixel input through on-line stroke representations, our system was fully tested only at the high end, with complete synthetic strokes. We obtained a success rate of 57%, 12 out of the 21 characters tested. The principal success of the work is that characters of the complexity tested could be recognized at all, and in the impact schema labelling techniques had on that recognition. === Science, Faculty of === Computer Science, Department of === Graduate |
author |
Bult, Timothy Paul |
author_facet |
Bult, Timothy Paul |
author_sort |
Bult, Timothy Paul |
title |
Schema labelling applied to hand-printed Chinese character recognition |
title_short |
Schema labelling applied to hand-printed Chinese character recognition |
title_full |
Schema labelling applied to hand-printed Chinese character recognition |
title_fullStr |
Schema labelling applied to hand-printed Chinese character recognition |
title_full_unstemmed |
Schema labelling applied to hand-printed Chinese character recognition |
title_sort |
schema labelling applied to hand-printed chinese character recognition |
publisher |
University of British Columbia |
publishDate |
2010 |
url |
http://hdl.handle.net/2429/26175 |
work_keys_str_mv |
AT bulttimothypaul schemalabellingappliedtohandprintedchinesecharacterrecognition |
_version_ |
1718593019737800704 |