An Algorithm for Natural Images Text Recognition Using Four Direction Features
Irregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-08-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/8/9/971 |
id |
doaj-1a1fa6f641d04bfba4b580e1e70d7cbc |
---|---|
record_format |
Article |
spelling |
doaj-1a1fa6f641d04bfba4b580e1e70d7cbc2020-11-25T01:39:51ZengMDPI AGElectronics2079-92922019-08-018997110.3390/electronics8090971electronics8090971An Algorithm for Natural Images Text Recognition Using Four Direction FeaturesMin Zhang0Yujin Yan1Hai Wang2Wei Zhao3School of Aerospace Science and Technology, Xidian University, Xi’an 710071, ChinaKey Laboratory of Electronic Equipment Structure Design, Ministry of Education, Xidian University, Xi’an 710071, ChinaSchool of Aerospace Science and Technology, Xidian University, Xi’an 710071, ChinaKey Laboratory of Electronic Equipment Structure Design, Ministry of Education, Xidian University, Xi’an 710071, ChinaIrregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features to fully describe the textual information. Meanwhile, the character placement possibility is extracted as the weight of the four direction features. Based on these works, we propose the encoder to fuse the four direction features for the generation of feature code to predict the character sequence. The whole network is end-to-end trainable due to using images and word-level labels. The experiments on standard benchmarks, including the IIIT-5K, SVT, CUTE80, and ICDAR datasets, demonstrate the superiority of the proposed method on both regular and irregular datasets. The developed method shows an increase of 1.2% in the CUTE80 dataset and 1.5% in the SVT dataset, and has fewer parameters than most existing methods.https://www.mdpi.com/2079-9292/8/9/971text recognitionconvolutional neural networklong short-term memory |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Min Zhang Yujin Yan Hai Wang Wei Zhao |
spellingShingle |
Min Zhang Yujin Yan Hai Wang Wei Zhao An Algorithm for Natural Images Text Recognition Using Four Direction Features Electronics text recognition convolutional neural network long short-term memory |
author_facet |
Min Zhang Yujin Yan Hai Wang Wei Zhao |
author_sort |
Min Zhang |
title |
An Algorithm for Natural Images Text Recognition Using Four Direction Features |
title_short |
An Algorithm for Natural Images Text Recognition Using Four Direction Features |
title_full |
An Algorithm for Natural Images Text Recognition Using Four Direction Features |
title_fullStr |
An Algorithm for Natural Images Text Recognition Using Four Direction Features |
title_full_unstemmed |
An Algorithm for Natural Images Text Recognition Using Four Direction Features |
title_sort |
algorithm for natural images text recognition using four direction features |
publisher |
MDPI AG |
series |
Electronics |
issn |
2079-9292 |
publishDate |
2019-08-01 |
description |
Irregular text has widespread applications in multiple areas. Different from regular text, irregular text is difficult to recognize because of its various shapes and distorted patterns. In this paper, we develop a multidirectional convolutional neural network (MCN) to extract four direction features to fully describe the textual information. Meanwhile, the character placement possibility is extracted as the weight of the four direction features. Based on these works, we propose the encoder to fuse the four direction features for the generation of feature code to predict the character sequence. The whole network is end-to-end trainable due to using images and word-level labels. The experiments on standard benchmarks, including the IIIT-5K, SVT, CUTE80, and ICDAR datasets, demonstrate the superiority of the proposed method on both regular and irregular datasets. The developed method shows an increase of 1.2% in the CUTE80 dataset and 1.5% in the SVT dataset, and has fewer parameters than most existing methods. |
topic |
text recognition convolutional neural network long short-term memory |
url |
https://www.mdpi.com/2079-9292/8/9/971 |
work_keys_str_mv |
AT minzhang analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT yujinyan analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT haiwang analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT weizhao analgorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT minzhang algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT yujinyan algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT haiwang algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures AT weizhao algorithmfornaturalimagestextrecognitionusingfourdirectionfeatures |
_version_ |
1725048668368666624 |