Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/9104986/ |
id |
doaj-954d271c10154b0faae392deea9e9b21 |
---|---|
record_format |
Article |
spelling |
doaj-954d271c10154b0faae392deea9e9b212021-03-30T02:16:29ZengIEEEIEEE Access2169-35362020-01-01810210610211810.1109/ACCESS.2020.29990699104986Arbitrary-Shaped Text Detection With Adaptive Text Region RepresentationXiufeng Jiang0https://orcid.org/0000-0001-7353-0086Shugong Xu1https://orcid.org/0000-0003-1905-6269Shunqing Zhang2https://orcid.org/0000-0002-5156-9235Shan Cao3https://orcid.org/0000-0003-3713-8671Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaText detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.https://ieeexplore.ieee.org/document/9104986/Scene text detectionarbitrary-shapedtext region representationdeformable convolutional network |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiufeng Jiang Shugong Xu Shunqing Zhang Shan Cao |
spellingShingle |
Xiufeng Jiang Shugong Xu Shunqing Zhang Shan Cao Arbitrary-Shaped Text Detection With Adaptive Text Region Representation IEEE Access Scene text detection arbitrary-shaped text region representation deformable convolutional network |
author_facet |
Xiufeng Jiang Shugong Xu Shunqing Zhang Shan Cao |
author_sort |
Xiufeng Jiang |
title |
Arbitrary-Shaped Text Detection With Adaptive Text Region Representation |
title_short |
Arbitrary-Shaped Text Detection With Adaptive Text Region Representation |
title_full |
Arbitrary-Shaped Text Detection With Adaptive Text Region Representation |
title_fullStr |
Arbitrary-Shaped Text Detection With Adaptive Text Region Representation |
title_full_unstemmed |
Arbitrary-Shaped Text Detection With Adaptive Text Region Representation |
title_sort |
arbitrary-shaped text detection with adaptive text region representation |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2020-01-01 |
description |
Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work. |
topic |
Scene text detection arbitrary-shaped text region representation deformable convolutional network |
url |
https://ieeexplore.ieee.org/document/9104986/ |
work_keys_str_mv |
AT xiufengjiang arbitraryshapedtextdetectionwithadaptivetextregionrepresentation AT shugongxu arbitraryshapedtextdetectionwithadaptivetextregionrepresentation AT shunqingzhang arbitraryshapedtextdetectionwithadaptivetextregionrepresentation AT shancao arbitraryshapedtextdetectionwithadaptivetextregionrepresentation |
_version_ |
1724185545521758208 |