An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms
The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databa...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2011-09-01
|
Series: | Sensors |
Subjects: | |
Online Access: | http://www.mdpi.com/1424-8220/11/9/8782/ |
id |
doaj-8d18853f15434e119bb33c222a7823ec |
---|---|
record_format |
Article |
spelling |
doaj-8d18853f15434e119bb33c222a7823ec2020-11-24T21:50:59ZengMDPI AGSensors1424-82202011-09-011198782881210.3390/s110908782An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation AlgorithmsZoran N. MilivojevicDragan R. MilivojevicDarko BrodicThe paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures.http://www.mdpi.com/1424-8220/11/9/8782/document image processingtext line segmentationalgorithmsexperiments frameworktestingsignal detection theory |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Zoran N. Milivojevic Dragan R. Milivojevic Darko Brodic |
spellingShingle |
Zoran N. Milivojevic Dragan R. Milivojevic Darko Brodic An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms Sensors document image processing text line segmentation algorithms experiments framework testing signal detection theory |
author_facet |
Zoran N. Milivojevic Dragan R. Milivojevic Darko Brodic |
author_sort |
Zoran N. Milivojevic |
title |
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms |
title_short |
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms |
title_full |
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms |
title_fullStr |
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms |
title_full_unstemmed |
An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms |
title_sort |
approach to a comprehensive test framework for analysis and evaluation of text line segmentation algorithms |
publisher |
MDPI AG |
series |
Sensors |
issn |
1424-8220 |
publishDate |
2011-09-01 |
description |
The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures. |
topic |
document image processing text line segmentation algorithms experiments framework testing signal detection theory |
url |
http://www.mdpi.com/1424-8220/11/9/8782/ |
work_keys_str_mv |
AT zorannmilivojevic anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms AT draganrmilivojevic anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms AT darkobrodic anapproachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms AT zorannmilivojevic approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms AT draganrmilivojevic approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms AT darkobrodic approachtoacomprehensivetestframeworkforanalysisandevaluationoftextlinesegmentationalgorithms |
_version_ |
1725881169841487872 |