A Novel Semantic Segmentation Model for Chinese Characters

Character segmentation plays an important role in optical character recognition (OCR). Due to the limitations of feature representation, traditional image analyzing based methods cannot well segment characters with connected or broken strokes, especially for the Chinese characters which usually have...

Full description

Bibliographic Details
Main Authors: Zhenyu Gao, Jin Liu, Yiyao Li, Yihe Yang, Huihua He
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9206578/
id doaj-eee0d8ced84d4e2ebae2effea099e68f
record_format Article
spelling doaj-eee0d8ced84d4e2ebae2effea099e68f2021-03-30T04:49:55ZengIEEEIEEE Access2169-35362020-01-01817908317909310.1109/ACCESS.2020.30270199206578A Novel Semantic Segmentation Model for Chinese CharactersZhenyu Gao0https://orcid.org/0000-0002-9687-2027Jin Liu1https://orcid.org/0000-0001-7249-698XYiyao Li2Yihe Yang3Huihua He4https://orcid.org/0000-0003-4628-7425College of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Education, Shanghai Normal University, Shanghai, ChinaCharacter segmentation plays an important role in optical character recognition (OCR). Due to the limitations of feature representation, traditional image analyzing based methods cannot well segment characters with connected or broken strokes, especially for the Chinese characters which usually have complex structures. To solve this issue, this paper proposes a novel segmentation model based on fully convolutional neural networks (FCN). The model first uses convolutional neural networks to extract spatial features, then shares them throughout the whole model. Two FCNs are used to extract character information to form a score map. Finally, character features are reused to adjust the accurate segmentation points in the score map. What's more, to strengthen the ability of feature representation, a novel compound character feature which can well describe the characters' outline is also proposed. The proposed method is validated on two datasets: GBSD and CASIA-HWDB-MT, against the methods proposed in the literature. Experimental results show that the proposed model outperforms state-of-the-art methods.https://ieeexplore.ieee.org/document/9206578/Chinese character segmentationcharacter feature extractionfully convolutional neural networks
collection DOAJ
language English
format Article
sources DOAJ
author Zhenyu Gao
Jin Liu
Yiyao Li
Yihe Yang
Huihua He
spellingShingle Zhenyu Gao
Jin Liu
Yiyao Li
Yihe Yang
Huihua He
A Novel Semantic Segmentation Model for Chinese Characters
IEEE Access
Chinese character segmentation
character feature extraction
fully convolutional neural networks
author_facet Zhenyu Gao
Jin Liu
Yiyao Li
Yihe Yang
Huihua He
author_sort Zhenyu Gao
title A Novel Semantic Segmentation Model for Chinese Characters
title_short A Novel Semantic Segmentation Model for Chinese Characters
title_full A Novel Semantic Segmentation Model for Chinese Characters
title_fullStr A Novel Semantic Segmentation Model for Chinese Characters
title_full_unstemmed A Novel Semantic Segmentation Model for Chinese Characters
title_sort novel semantic segmentation model for chinese characters
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Character segmentation plays an important role in optical character recognition (OCR). Due to the limitations of feature representation, traditional image analyzing based methods cannot well segment characters with connected or broken strokes, especially for the Chinese characters which usually have complex structures. To solve this issue, this paper proposes a novel segmentation model based on fully convolutional neural networks (FCN). The model first uses convolutional neural networks to extract spatial features, then shares them throughout the whole model. Two FCNs are used to extract character information to form a score map. Finally, character features are reused to adjust the accurate segmentation points in the score map. What's more, to strengthen the ability of feature representation, a novel compound character feature which can well describe the characters' outline is also proposed. The proposed method is validated on two datasets: GBSD and CASIA-HWDB-MT, against the methods proposed in the literature. Experimental results show that the proposed model outperforms state-of-the-art methods.
topic Chinese character segmentation
character feature extraction
fully convolutional neural networks
url https://ieeexplore.ieee.org/document/9206578/
work_keys_str_mv AT zhenyugao anovelsemanticsegmentationmodelforchinesecharacters
AT jinliu anovelsemanticsegmentationmodelforchinesecharacters
AT yiyaoli anovelsemanticsegmentationmodelforchinesecharacters
AT yiheyang anovelsemanticsegmentationmodelforchinesecharacters
AT huihuahe anovelsemanticsegmentationmodelforchinesecharacters
AT zhenyugao novelsemanticsegmentationmodelforchinesecharacters
AT jinliu novelsemanticsegmentationmodelforchinesecharacters
AT yiyaoli novelsemanticsegmentationmodelforchinesecharacters
AT yiheyang novelsemanticsegmentationmodelforchinesecharacters
AT huihuahe novelsemanticsegmentationmodelforchinesecharacters
_version_ 1724181165778141184