Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms

<p/> <p>Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbiased evaluation of segmentation quality: the absence or lack of a common objective representation for segmentation results. Such a representation, known as the ground truth, is a desc...

Full description

Bibliographic Details
Main Authors: Okun Oleg, Pietik&#228;inen Matti
Format: Article
Language:English
Published: SpringerOpen 2006-01-01
Series:EURASIP Journal on Advances in Signal Processing
Online Access:http://dx.doi.org/10.1155/ASP/2006/12093
id doaj-bdc83cab8f3b42e398b680fa4b51e775
record_format Article
spelling doaj-bdc83cab8f3b42e398b680fa4b51e7752020-11-25T01:03:49ZengSpringerOpenEURASIP Journal on Advances in Signal Processing1687-61721687-61802006-01-0120061012093Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation AlgorithmsOkun OlegPietik&#228;inen Matti<p/> <p>Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbiased evaluation of segmentation quality: the absence or lack of a common objective representation for segmentation results. Such a representation, known as the ground truth, is a description of what one should obtain as the result of ideal segmentation, independently of the segmentation algorithm used. The creation of ground truth is a laborious process and therefore any degree of automation is always welcome. Document image analysis is one of the areas where ground truths are employed. In this paper, we describe an automated tool called GROTTO intended to generate ground truths for skewed document images, which can be used for the performance evaluation of page segmentation algorithms. Some of these algorithms are claimed to be insensitive to skew (tilt of text lines). However, this fact is usually supported only by a visual comparison of what one obtains and what one should obtain since ground truths are mostly available for upright images, that is, those without skew. As a result, the evaluation is both subjective; that is, prone to errors, and tedious. Our tool allows users to quickly and easily produce many sufficiently accurate ground truths that can be employed in practice and therefore it facilitates automatic performance evaluation. The main idea is to utilize the ground truths available for upright images and the concept of the representative square [9] in order to produce the ground truths for skewed images. The usefulness of our tool is demonstrated through a number of experiments with real-document images of complex layout.</p> http://dx.doi.org/10.1155/ASP/2006/12093
collection DOAJ
language English
format Article
sources DOAJ
author Okun Oleg
Pietik&#228;inen Matti
spellingShingle Okun Oleg
Pietik&#228;inen Matti
Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
EURASIP Journal on Advances in Signal Processing
author_facet Okun Oleg
Pietik&#228;inen Matti
author_sort Okun Oleg
title Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
title_short Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
title_full Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
title_fullStr Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
title_full_unstemmed Fast and Accurate Ground Truth Generation for Skew-Tolerance Evaluation of Page Segmentation Algorithms
title_sort fast and accurate ground truth generation for skew-tolerance evaluation of page segmentation algorithms
publisher SpringerOpen
series EURASIP Journal on Advances in Signal Processing
issn 1687-6172
1687-6180
publishDate 2006-01-01
description <p/> <p>Many image segmentation algorithms are known, but often there is an inherent obstacle in the unbiased evaluation of segmentation quality: the absence or lack of a common objective representation for segmentation results. Such a representation, known as the ground truth, is a description of what one should obtain as the result of ideal segmentation, independently of the segmentation algorithm used. The creation of ground truth is a laborious process and therefore any degree of automation is always welcome. Document image analysis is one of the areas where ground truths are employed. In this paper, we describe an automated tool called GROTTO intended to generate ground truths for skewed document images, which can be used for the performance evaluation of page segmentation algorithms. Some of these algorithms are claimed to be insensitive to skew (tilt of text lines). However, this fact is usually supported only by a visual comparison of what one obtains and what one should obtain since ground truths are mostly available for upright images, that is, those without skew. As a result, the evaluation is both subjective; that is, prone to errors, and tedious. Our tool allows users to quickly and easily produce many sufficiently accurate ground truths that can be employed in practice and therefore it facilitates automatic performance evaluation. The main idea is to utilize the ground truths available for upright images and the concept of the representative square [9] in order to produce the ground truths for skewed images. The usefulness of our tool is demonstrated through a number of experiments with real-document images of complex layout.</p>
url http://dx.doi.org/10.1155/ASP/2006/12093
work_keys_str_mv AT okunoleg fastandaccurategroundtruthgenerationforskewtoleranceevaluationofpagesegmentationalgorithms
AT pietik228inenmatti fastandaccurategroundtruthgenerationforskewtoleranceevaluationofpagesegmentationalgorithms
_version_ 1725199332165025792