An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition

碩士 === 國立中央大學 === 資訊工程學系 === 107 === The application of text detection and recognition on optical images is quite extensive. For example, recognition of production date, product part number and drug number, etc... To recognize the text on an image, one has to first detect the bounding box of the tex...

Full description

Bibliographic Details
Main Authors: Yang, Kai-Lin, 楊凱霖
Other Authors: Yung-Hui Li
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/m8j4z2
id ndltd-TW-107NCU05392120
record_format oai_dc
spelling ndltd-TW-107NCU053921202019-10-22T05:28:14Z http://ndltd.ncl.edu.tw/handle/m8j4z2 An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition 基於深度學習之工業用智慧型機器視覺系統:以文字定位與辨識為例 Yang, Kai-Lin 楊凱霖 碩士 國立中央大學 資訊工程學系 107 The application of text detection and recognition on optical images is quite extensive. For example, recognition of production date, product part number and drug number, etc... To recognize the text on an image, one has to first detect the bounding box of the text, and then perform the text recognition for the localized image. However, in order to get a very accurate and robust results under deep learning method, huge amount of data is indispensable for the training of the network model. In addition, before training and testing a deep learning model, it is important to preprocess the image, such as image cropping, scaling and rotating… etc. Data augmentation, which is an approach to increase the number of images, is also important. However, image preprocessing is a very time-consuming and tedious work. In this research, transfer learning is applied to achieve the goal of deep learning training using a small amount of data and get a model with a good accuracy and robustness. In addition to the large amount of data and time required in pre-training a model, the subsequent retrained model can achieve an accuracy higher than 95% in a small amount of text image data. Yung-Hui Li 栗永徽 2019 學位論文 ; thesis 51 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊工程學系 === 107 === The application of text detection and recognition on optical images is quite extensive. For example, recognition of production date, product part number and drug number, etc... To recognize the text on an image, one has to first detect the bounding box of the text, and then perform the text recognition for the localized image. However, in order to get a very accurate and robust results under deep learning method, huge amount of data is indispensable for the training of the network model. In addition, before training and testing a deep learning model, it is important to preprocess the image, such as image cropping, scaling and rotating… etc. Data augmentation, which is an approach to increase the number of images, is also important. However, image preprocessing is a very time-consuming and tedious work. In this research, transfer learning is applied to achieve the goal of deep learning training using a small amount of data and get a model with a good accuracy and robustness. In addition to the large amount of data and time required in pre-training a model, the subsequent retrained model can achieve an accuracy higher than 95% in a small amount of text image data.
author2 Yung-Hui Li
author_facet Yung-Hui Li
Yang, Kai-Lin
楊凱霖
author Yang, Kai-Lin
楊凱霖
spellingShingle Yang, Kai-Lin
楊凱霖
An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition
author_sort Yang, Kai-Lin
title An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition
title_short An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition
title_full An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition
title_fullStr An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition
title_full_unstemmed An Industrial AI Vision System based on Deep Learning : A Case Study of Industrial Text Localization and Recognition
title_sort industrial ai vision system based on deep learning : a case study of industrial text localization and recognition
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/m8j4z2
work_keys_str_mv AT yangkailin anindustrialaivisionsystembasedondeeplearningacasestudyofindustrialtextlocalizationandrecognition
AT yángkǎilín anindustrialaivisionsystembasedondeeplearningacasestudyofindustrialtextlocalizationandrecognition
AT yangkailin jīyúshēndùxuéxízhīgōngyèyòngzhìhuìxíngjīqìshìjuéxìtǒngyǐwénzìdìngwèiyǔbiànshíwèilì
AT yángkǎilín jīyúshēndùxuéxízhīgōngyèyòngzhìhuìxíngjīqìshìjuéxìtǒngyǐwénzìdìngwèiyǔbiànshíwèilì
AT yangkailin industrialaivisionsystembasedondeeplearningacasestudyofindustrialtextlocalizationandrecognition
AT yángkǎilín industrialaivisionsystembasedondeeplearningacasestudyofindustrialtextlocalizationandrecognition
_version_ 1719274236468527104