Context Dependent Thresholding and Filter Selection for Optical Character Recognition

Thresholding algorithms and filters are of great importance when utilizing OCR to extract information from text documents such as invoices. Invoice documents vary greatly and since the performance of image processing methods when applied to those documents will vary accordingly, selecting appropriat...

Full description

Bibliographic Details
Main Author: Kieri, Andreas
Format: Others
Language:English
Published: Uppsala universitet, Institutionen för informationsteknologi 2012
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-197460
id ndltd-UPSALLA1-oai-DiVA.org-uu-197460
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-uu-1974602013-05-07T04:04:31ZContext Dependent Thresholding and Filter Selection for Optical Character RecognitionengKieri, AndreasUppsala universitet, Institutionen för informationsteknologi2012digital image analysisimage thresholdingimage filteringmachine learningThresholding algorithms and filters are of great importance when utilizing OCR to extract information from text documents such as invoices. Invoice documents vary greatly and since the performance of image processing methods when applied to those documents will vary accordingly, selecting appropriate methods is critical if a high recognition rate is to be obtained. This paper aims to determine if a document recognition system that automatically selects optimal processing methods, based on the characteristics of input images, will yield a higher recognition rate than what can be achieved by a manual choice. Such a recognition system, including a learning framework for selecting optimal thresholding algorithms and filters, was developed and evaluated. It was established that an automatic selection will ensure a high recognition rate when applied to a set of arbitrary invoice images by successfully adapting and avoiding the methods that yield poor recognition rates. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-197460UPTEC F, 1401-5757 ; 12 036application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic digital image analysis
image thresholding
image filtering
machine learning
spellingShingle digital image analysis
image thresholding
image filtering
machine learning
Kieri, Andreas
Context Dependent Thresholding and Filter Selection for Optical Character Recognition
description Thresholding algorithms and filters are of great importance when utilizing OCR to extract information from text documents such as invoices. Invoice documents vary greatly and since the performance of image processing methods when applied to those documents will vary accordingly, selecting appropriate methods is critical if a high recognition rate is to be obtained. This paper aims to determine if a document recognition system that automatically selects optimal processing methods, based on the characteristics of input images, will yield a higher recognition rate than what can be achieved by a manual choice. Such a recognition system, including a learning framework for selecting optimal thresholding algorithms and filters, was developed and evaluated. It was established that an automatic selection will ensure a high recognition rate when applied to a set of arbitrary invoice images by successfully adapting and avoiding the methods that yield poor recognition rates.
author Kieri, Andreas
author_facet Kieri, Andreas
author_sort Kieri, Andreas
title Context Dependent Thresholding and Filter Selection for Optical Character Recognition
title_short Context Dependent Thresholding and Filter Selection for Optical Character Recognition
title_full Context Dependent Thresholding and Filter Selection for Optical Character Recognition
title_fullStr Context Dependent Thresholding and Filter Selection for Optical Character Recognition
title_full_unstemmed Context Dependent Thresholding and Filter Selection for Optical Character Recognition
title_sort context dependent thresholding and filter selection for optical character recognition
publisher Uppsala universitet, Institutionen för informationsteknologi
publishDate 2012
url http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-197460
work_keys_str_mv AT kieriandreas contextdependentthresholdingandfilterselectionforopticalcharacterrecognition
_version_ 1716585428550680576