Arbitrary-Shaped Text Detection With Adaptive Text Region Representation

Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations...

Full description

Bibliographic Details
Main Authors: Xiufeng Jiang, Shugong Xu, Shunqing Zhang, Shan Cao
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9104986/
id doaj-954d271c10154b0faae392deea9e9b21
record_format Article
spelling doaj-954d271c10154b0faae392deea9e9b212021-03-30T02:16:29ZengIEEEIEEE Access2169-35362020-01-01810210610211810.1109/ACCESS.2020.29990699104986Arbitrary-Shaped Text Detection With Adaptive Text Region RepresentationXiufeng Jiang0https://orcid.org/0000-0001-7353-0086Shugong Xu1https://orcid.org/0000-0003-1905-6269Shunqing Zhang2https://orcid.org/0000-0002-5156-9235Shan Cao3https://orcid.org/0000-0003-3713-8671Shanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaShanghai Institute for Advanced Communication and Data Science, Shanghai University, Shanghai, ChinaText detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.https://ieeexplore.ieee.org/document/9104986/Scene text detectionarbitrary-shapedtext region representationdeformable convolutional network
collection DOAJ
language English
format Article
sources DOAJ
author Xiufeng Jiang
Shugong Xu
Shunqing Zhang
Shan Cao
spellingShingle Xiufeng Jiang
Shugong Xu
Shunqing Zhang
Shan Cao
Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
IEEE Access
Scene text detection
arbitrary-shaped
text region representation
deformable convolutional network
author_facet Xiufeng Jiang
Shugong Xu
Shunqing Zhang
Shan Cao
author_sort Xiufeng Jiang
title Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
title_short Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
title_full Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
title_fullStr Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
title_full_unstemmed Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
title_sort arbitrary-shaped text detection with adaptive text region representation
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description Text detection/localization, as an important task in computer vision, has witnessed substantial advancements in methodology and performance with convolutional neural networks. However, the vast majority of popular methods use rectangles or quadrangles to describe text regions. These representations have inherent drawbacks, especially relating to dense adjacent text and loose regional text boundaries, which usually cause difficulty detecting arbitrarily shaped text. In this paper, we propose a novel text region representation method, with a robust pipeline, which can precisely detect dense adjacent text instances with arbitrary shapes. We consider a text instance to be composed of an adaptive central text region mask and a corresponding expanding ratio between the central text region and the full text region. More specifically, our pipeline generates adaptive central text regions and corresponding expanding ratios with a proposed training strategy, followed by a new proposed post-processing algorithm which expands central text regions to the complete text instance with the corresponding expanding ratios. We demonstrated that our new text region representation is effective, and that the pipeline can precisely detect closely adjacent text instances of arbitrary shapes. Experimental results on common datasets demonstrate superior performance of our work.
topic Scene text detection
arbitrary-shaped
text region representation
deformable convolutional network
url https://ieeexplore.ieee.org/document/9104986/
work_keys_str_mv AT xiufengjiang arbitraryshapedtextdetectionwithadaptivetextregionrepresentation
AT shugongxu arbitraryshapedtextdetectionwithadaptivetextregionrepresentation
AT shunqingzhang arbitraryshapedtextdetectionwithadaptivetextregionrepresentation
AT shancao arbitraryshapedtextdetectionwithadaptivetextregionrepresentation
_version_ 1724185545521758208