Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence

In a multilingual country like India, script recognition is an important pre-processing footstep necessary for feeding any document to an optical character recognition (OCR) engine, which is, in general, script specific. The present work evaluates the performance of an ensemble of two MLP (multi-lay...

Full description

Bibliographic Details
Main Authors: Mukhopadhyay Anirban, Singh Pawan Kumar, Sarkar Ram, Nasipuri Mita
Format: Article
Language:English
Published: De Gruyter 2018-02-01
Series:Journal of Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1515/jisys-2017-0431
id doaj-0aa1ec3141ec4bf29b534219067d0e85
record_format Article
spelling doaj-0aa1ec3141ec4bf29b534219067d0e852021-09-06T19:40:38ZengDe GruyterJournal of Intelligent Systems0334-18602191-026X2018-02-0129126428210.1515/jisys-2017-0431Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of EvidenceMukhopadhyay Anirban0Singh Pawan Kumar1Sarkar Ram2Nasipuri Mita3Department of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaDepartment of Computer Science and Engineering, Jadavpur University, 188 Raja S.C. Mullick Road, Kolkata-700032, West Bengal, IndiaIn a multilingual country like India, script recognition is an important pre-processing footstep necessary for feeding any document to an optical character recognition (OCR) engine, which is, in general, script specific. The present work evaluates the performance of an ensemble of two MLP (multi-layer perceptron) classifiers, each trained on different feature sets. Here, two complementary sets of features, namely, gray-level co-occurrence matrix (GLCM) and Gabor wavelets transform coefficients are extracted from each of the handwritten text-line and word images written in 12 official scripts used in Indian subcontinent, which are then fed into an individual classifier. In order to improve the overall recognition rate, a powerful combination approach based on the Dempster–Shafer (DS) theory is finally employed to fuse the decisions of two MLP classifiers. The performance of the combined decision is compared with those of the individual classifiers, and it is noted that a significant improvement in recognition accuracy (about 4% for text-line data and 6% for word level data) has been achieved by the proposed methodology.https://doi.org/10.1515/jisys-2017-0431handwritten script recognitiondempster–shafer theoryindic scriptsgray-level co-occurrence matrixgabor wavelet transform
collection DOAJ
language English
format Article
sources DOAJ
author Mukhopadhyay Anirban
Singh Pawan Kumar
Sarkar Ram
Nasipuri Mita
spellingShingle Mukhopadhyay Anirban
Singh Pawan Kumar
Sarkar Ram
Nasipuri Mita
Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
Journal of Intelligent Systems
handwritten script recognition
dempster–shafer theory
indic scripts
gray-level co-occurrence matrix
gabor wavelet transform
author_facet Mukhopadhyay Anirban
Singh Pawan Kumar
Sarkar Ram
Nasipuri Mita
author_sort Mukhopadhyay Anirban
title Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
title_short Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
title_full Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
title_fullStr Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
title_full_unstemmed Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
title_sort handwritten indic script recognition based on the dempster–shafer theory of evidence
publisher De Gruyter
series Journal of Intelligent Systems
issn 0334-1860
2191-026X
publishDate 2018-02-01
description In a multilingual country like India, script recognition is an important pre-processing footstep necessary for feeding any document to an optical character recognition (OCR) engine, which is, in general, script specific. The present work evaluates the performance of an ensemble of two MLP (multi-layer perceptron) classifiers, each trained on different feature sets. Here, two complementary sets of features, namely, gray-level co-occurrence matrix (GLCM) and Gabor wavelets transform coefficients are extracted from each of the handwritten text-line and word images written in 12 official scripts used in Indian subcontinent, which are then fed into an individual classifier. In order to improve the overall recognition rate, a powerful combination approach based on the Dempster–Shafer (DS) theory is finally employed to fuse the decisions of two MLP classifiers. The performance of the combined decision is compared with those of the individual classifiers, and it is noted that a significant improvement in recognition accuracy (about 4% for text-line data and 6% for word level data) has been achieved by the proposed methodology.
topic handwritten script recognition
dempster–shafer theory
indic scripts
gray-level co-occurrence matrix
gabor wavelet transform
url https://doi.org/10.1515/jisys-2017-0431
work_keys_str_mv AT mukhopadhyayanirban handwrittenindicscriptrecognitionbasedonthedempstershafertheoryofevidence
AT singhpawankumar handwrittenindicscriptrecognitionbasedonthedempstershafertheoryofevidence
AT sarkarram handwrittenindicscriptrecognitionbasedonthedempstershafertheoryofevidence
AT nasipurimita handwrittenindicscriptrecognitionbasedonthedempstershafertheoryofevidence
_version_ 1717767977103785984