Recognizing Cursive Typewritten Text Using Segmentation-Free System

Feature extraction plays an important role in text recognition as it aims to capture essential characteristics of the text image. Feature extraction algorithms widely range between robust and hard to extract features and noise sensitive and easy to extract features. Among those feature types are sta...

Full description

Bibliographic Details
Main Author: Mohammad S. Khorsheed
Format: Article
Language:English
Published: Hindawi Limited 2015-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2015/818432
id doaj-e8651102f450437da5e1670ee44f689d
record_format Article
spelling doaj-e8651102f450437da5e1670ee44f689d2020-11-24T22:15:14ZengHindawi LimitedThe Scientific World Journal2356-61401537-744X2015-01-01201510.1155/2015/818432818432Recognizing Cursive Typewritten Text Using Segmentation-Free SystemMohammad S. Khorsheed0National Center for Robotics and Intelligent Systems, King Abdulaziz City for Science & Technology, P.O. Box 6086, Riyadh 11442, Saudi ArabiaFeature extraction plays an important role in text recognition as it aims to capture essential characteristics of the text image. Feature extraction algorithms widely range between robust and hard to extract features and noise sensitive and easy to extract features. Among those feature types are statistical features which are derived from the statistical distribution of the image pixels. This paper presents a novel method for feature extraction where simple statistical features are extracted from a one-pixel wide window that slides across the text line. The feature set is clustered in the feature space using vector quantization. The feature vector sequence is then injected to a classification engine for training and recognition purposes. The recognition system is applied to a data corpus which includes cursive Arabic text of more than 600 A4-size sheets typewritten in multiple computer-generated fonts. The system performance is compared to a previously published system from the literature with a similar engine but a different feature set.http://dx.doi.org/10.1155/2015/818432
collection DOAJ
language English
format Article
sources DOAJ
author Mohammad S. Khorsheed
spellingShingle Mohammad S. Khorsheed
Recognizing Cursive Typewritten Text Using Segmentation-Free System
The Scientific World Journal
author_facet Mohammad S. Khorsheed
author_sort Mohammad S. Khorsheed
title Recognizing Cursive Typewritten Text Using Segmentation-Free System
title_short Recognizing Cursive Typewritten Text Using Segmentation-Free System
title_full Recognizing Cursive Typewritten Text Using Segmentation-Free System
title_fullStr Recognizing Cursive Typewritten Text Using Segmentation-Free System
title_full_unstemmed Recognizing Cursive Typewritten Text Using Segmentation-Free System
title_sort recognizing cursive typewritten text using segmentation-free system
publisher Hindawi Limited
series The Scientific World Journal
issn 2356-6140
1537-744X
publishDate 2015-01-01
description Feature extraction plays an important role in text recognition as it aims to capture essential characteristics of the text image. Feature extraction algorithms widely range between robust and hard to extract features and noise sensitive and easy to extract features. Among those feature types are statistical features which are derived from the statistical distribution of the image pixels. This paper presents a novel method for feature extraction where simple statistical features are extracted from a one-pixel wide window that slides across the text line. The feature set is clustered in the feature space using vector quantization. The feature vector sequence is then injected to a classification engine for training and recognition purposes. The recognition system is applied to a data corpus which includes cursive Arabic text of more than 600 A4-size sheets typewritten in multiple computer-generated fonts. The system performance is compared to a previously published system from the literature with a similar engine but a different feature set.
url http://dx.doi.org/10.1155/2015/818432
work_keys_str_mv AT mohammadskhorsheed recognizingcursivetypewrittentextusingsegmentationfreesystem
_version_ 1725795274851352576