An Algorithm for Natural Images Text Recognition Using Four Direction Features

Irregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features...

Full description

Bibliographic Details
Main Authors: Min Zhang, Yujin Yan, Hai Wang, Wei Zhao
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/8/9/971
id doaj-1a1fa6f641d04bfba4b580e1e70d7cbc
record_format Article
spelling doaj-1a1fa6f641d04bfba4b580e1e70d7cbc2020-11-25T01:39:51ZengMDPI AGElectronics2079-92922019-08-018997110.3390/electronics8090971electronics8090971An Algorithm for Natural Images Text Recognition Using Four Direction FeaturesMin Zhang0Yujin Yan1Hai Wang2Wei Zhao3School of Aerospace Science and Technology, Xidian University, Xi’an 710071, ChinaKey Laboratory of Electronic Equipment Structure Design, Ministry of Education, Xidian University, Xi’an 710071, ChinaSchool of Aerospace Science and Technology, Xidian University, Xi’an 710071, ChinaKey Laboratory of Electronic Equipment Structure Design, Ministry of Education, Xidian University, Xi’an 710071, ChinaIrregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features to fully describe the textual information. Meanwhile, the character placement possibility is extracted as the weight of the four direction features. Based on these works, we propose the encoder to fuse the four direction features for the generation of feature code to predict the character sequence. The whole network is end-to-end trainable due to using images and word-level labels. The experiments on standard benchmarks, including the IIIT-5K, SVT, CUTE80, and ICDAR datasets, demonstrate the superiority of the proposed method on both regular and irregular datasets. The developed method shows an increase of 1.2% in the CUTE80 dataset and 1.5% in the SVT dataset, and has fewer parameters than most existing methods.https://www.mdpi.com/2079-9292/8/9/971text recognitionconvolutional neural networklong short-term memory
collection DOAJ
language English
format Article
sources DOAJ
author Min Zhang
Yujin Yan
Hai Wang
Wei Zhao
spellingShingle Min Zhang
Yujin Yan
Hai Wang
Wei Zhao
An Algorithm for Natural Images Text Recognition Using Four Direction Features
Electronics
text recognition
convolutional neural network
long short-term memory
author_facet Min Zhang
Yujin Yan
Hai Wang
Wei Zhao
author_sort Min Zhang
title An Algorithm for Natural Images Text Recognition Using Four Direction Features
title_short An Algorithm for Natural Images Text Recognition Using Four Direction Features
title_full An Algorithm for Natural Images Text Recognition Using Four Direction Features
title_fullStr An Algorithm for Natural Images Text Recognition Using Four Direction Features
title_full_unstemmed An Algorithm for Natural Images Text Recognition Using Four Direction Features
title_sort algorithm for natural images text recognition using four direction features
publisher MDPI AG
series Electronics
issn 2079-9292
publishDate 2019-08-01
description Irregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features to fully describe the textual information. Meanwhile, the character placement possibility is extracted as the weight of the four direction features. Based on these works, we propose the encoder to fuse the four direction features for the generation of feature code to predict the character sequence. The whole network is end-to-end trainable due to using images and word-level labels. The experiments on standard benchmarks, including the IIIT-5K, SVT, CUTE80, and ICDAR datasets, demonstrate the superiority of the proposed method on both regular and irregular datasets. The developed method shows an increase of 1.2% in the CUTE80 dataset and 1.5% in the SVT dataset, and has fewer parameters than most existing methods.
topic text recognition
convolutional neural network
long short-term memory
url https://www.mdpi.com/2079-9292/8/9/971
work_keys_str_mv AT minzhang analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT yujinyan analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT haiwang analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT weizhao analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT minzhang algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT yujinyan algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT haiwang algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
AT weizhao algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures
_version_ 1725048668368666624