An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms

The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databa...

Full description

Bibliographic Details
Main Authors: Zoran N. Milivojevic, Dragan R. Milivojevic, Darko Brodic
Format: Article
Language:English
Published: MDPI AG 2011-09-01
Series:Sensors
Subjects:
Online Access:http://www.mdpi.com/1424-8220/11/9/8782/
id doaj-8d18853f15434e119bb33c222a7823ec
record_format Article
spelling doaj-8d18853f15434e119bb33c222a7823ec2020-11-24T21:50:59ZengMDPI AGSensors1424-82202011-09-011198782881210.3390/s110908782An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation AlgorithmsZoran N. MilivojevicDragan R. MilivojevicDarko BrodicThe paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures.http://www.mdpi.com/1424-8220/11/9/8782/document image processingtext line segmentationalgorithmsexperiments frameworktestingsignal detection theory
collection DOAJ
language English
format Article
sources DOAJ
author Zoran N. Milivojevic
Dragan R. Milivojevic
Darko Brodic
spellingShingle Zoran N. Milivojevic
Dragan R. Milivojevic
Darko Brodic
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
Sensors
document image processing
text line segmentation
algorithms
experiments framework
testing
signal detection theory
author_facet Zoran N. Milivojevic
Dragan R. Milivojevic
Darko Brodic
author_sort Zoran N. Milivojevic
title An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_short An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_full An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_fullStr An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_full_unstemmed An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
title_sort approach to a comprehensive test framework for analysis and evaluation of text line segmentation algorithms
publisher MDPI AG
series Sensors
issn 1424-8220
publishDate 2011-09-01
description The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures.
topic document image processing
text line segmentation
algorithms
experiments framework
testing
signal detection theory
url http://www.mdpi.com/1424-8220/11/9/8782/
work_keys_str_mv AT zorannmilivojevic anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT draganrmilivojevic anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT darkobrodic anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT zorannmilivojevic approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT draganrmilivojevic approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
AT darkobrodic approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms
_version_ 1725881169841487872