Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers

Analytics such as machine learning are of big interest in many types of industries. Optical character recognition is essentially a solved problem, whereas number recognition on real-world images which can be one form of machine learning are a more challenging obstacle. The purpose of this study was...

Full description

Bibliographic Details
Main Author: Munter, Johan
Format: Others
Language:English
Published: Mittuniversitetet, Institutionen för informationssystem och –teknologi 2020
Subjects:
Online Access:http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-39365
id ndltd-UPSALLA1-oai-DiVA.org-miun-39365
record_format oai_dc
spelling ndltd-UPSALLA1-oai-DiVA.org-miun-393652020-06-27T03:32:56ZNumber Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbersengMunter, JohanMittuniversitetet, Institutionen för informationssystem och –teknologi2020Number recognitionpreprocessingmachine learningforest industryComputer SystemsDatorsystemAnalytics such as machine learning are of big interest in many types of industries. Optical character recognition is essentially a solved problem, whereas number recognition on real-world images which can be one form of machine learning are a more challenging obstacle. The purpose of this study was to implement a system that can detect and read numbers on given dataset originating from the forest industry being images of color-stamped logs. This study evaluated accuracy of segmentation and number recognition on images of color-stamped logs when using a pre-trained model of the street view house numbers dataset. The general approach of preprocessing was based on car number plate segmentation because of the similar problem of identifying an object to then locate individual digits. Color segmentation were the biggest asset for the preprocessing because of the distinct red color of digits compared to the rest of the image. The accuracy of number recognition was significantly lower when using the pre-trained model on color-stamped logs being 26% in comparison to street view house numbers with 95% but could still reach over 80% per digit accuracy rate for some image classes when excluding accuracy of segmentation. The highest segmentation accuracy among classes was 93% and the lowest was 32%. From the results it was concluded that unclear digits on images lessened the number recognition accuracy the most. There are much to consider for future work, but the most obvious and impactful change would be to train a more accurate model by basing it on the dataset of color-stamped logs. Student thesisinfo:eu-repo/semantics/bachelorThesistexthttp://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-39365Local DT-V20-A2-010application/pdfinfo:eu-repo/semantics/openAccess
collection NDLTD
language English
format Others
sources NDLTD
topic Number recognition
preprocessing
machine learning
forest industry
Computer Systems
Datorsystem
spellingShingle Number recognition
preprocessing
machine learning
forest industry
Computer Systems
Datorsystem
Munter, Johan
Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
description Analytics such as machine learning are of big interest in many types of industries. Optical character recognition is essentially a solved problem, whereas number recognition on real-world images which can be one form of machine learning are a more challenging obstacle. The purpose of this study was to implement a system that can detect and read numbers on given dataset originating from the forest industry being images of color-stamped logs. This study evaluated accuracy of segmentation and number recognition on images of color-stamped logs when using a pre-trained model of the street view house numbers dataset. The general approach of preprocessing was based on car number plate segmentation because of the similar problem of identifying an object to then locate individual digits. Color segmentation were the biggest asset for the preprocessing because of the distinct red color of digits compared to the rest of the image. The accuracy of number recognition was significantly lower when using the pre-trained model on color-stamped logs being 26% in comparison to street view house numbers with 95% but could still reach over 80% per digit accuracy rate for some image classes when excluding accuracy of segmentation. The highest segmentation accuracy among classes was 93% and the lowest was 32%. From the results it was concluded that unclear digits on images lessened the number recognition accuracy the most. There are much to consider for future work, but the most obvious and impactful change would be to train a more accurate model by basing it on the dataset of color-stamped logs.
author Munter, Johan
author_facet Munter, Johan
author_sort Munter, Johan
title Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
title_short Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
title_full Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
title_fullStr Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
title_full_unstemmed Number Recognition of Real-world Images in the Forest Industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
title_sort number recognition of real-world images in the forest industry : a study of segmentation and recognition of numbers on images of logs with color-stamped numbers
publisher Mittuniversitetet, Institutionen för informationssystem och –teknologi
publishDate 2020
url http://urn.kb.se/resolve?urn=urn:nbn:se:miun:diva-39365
work_keys_str_mv AT munterjohan numberrecognitionofrealworldimagesintheforestindustryastudyofsegmentationandrecognitionofnumbersonimagesoflogswithcolorstampednumbers
_version_ 1719323979523555328