Selective Multi-Scale Feature Learning by Discriminative Local Representation

In the computer vision community, the general trend has been to capture and select discriminative features in order to yield significantly better performance. Recent advances in attention mechanism proposed several attention blocks to adaptively recalibrate the feature response. However, most of the...

Full description

Bibliographic Details
Main Authors: Chengji Xu, Xiaofeng Wang, Yadong Yang
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8825814/
id doaj-4a2c980a79d4491ebd97732bf72e9323
record_format Article
spelling doaj-4a2c980a79d4491ebd97732bf72e93232021-03-29T23:42:25ZengIEEEIEEE Access2169-35362019-01-01712732712733810.1109/ACCESS.2019.29397168825814Selective Multi-Scale Feature Learning by Discriminative Local RepresentationChengji Xu0https://orcid.org/0000-0003-0725-0902Xiaofeng Wang1Yadong Yang2College of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaIn the computer vision community, the general trend has been to capture and select discriminative features in order to yield significantly better performance. Recent advances in attention mechanism proposed several attention blocks to adaptively recalibrate the feature response. However, most of them overlooked the context information at a multi-scale level. In this paper, we propose a simple yet effective building block for ResNeXt-style backbones, namely discriminative local representation (DLR) module, which allows discriminative local representation learning for multi-scale feature information across multi-parallel branches. Our DLR module contains two sub-modules: channel selective module (CSM) and spatial selective module (SSM). Given an intermediate feature map, the CSM first selectively generates the channel-wise attention maps and recalibrates the response from different branches according to the weight vector calculated by softmax layer. And then, the SSM further captures the spatial discriminative information at different scales respectively and emphasizes the interdependent channel maps. Besides, we place a high-order item during the process of multi-branch fusion and residual connection to enhance the intensity of structure nonlinearity. Various DLR modules can be stacked to a deep convolution network named DLRNet. To validate our DLRNet, we conduct comprehensive experiments on classification benchmarks (i.e. CIFAR10, CIFAR100 and ImageNet-1K), as well as two publicly available fine-grained datasets (i.e. CUB-200-2011 and Stanford Dogs). The experiments show consistent improvement gains over previous baseline models with reasonable overhead, and demonstrate the capability of our proposed method for discriminative local representation.https://ieeexplore.ieee.org/document/8825814/Attention mechanismdiscriminative local representationmulti-scale feature informationhigh-order item
collection DOAJ
language English
format Article
sources DOAJ
author Chengji Xu
Xiaofeng Wang
Yadong Yang
spellingShingle Chengji Xu
Xiaofeng Wang
Yadong Yang
Selective Multi-Scale Feature Learning by Discriminative Local Representation
IEEE Access
Attention mechanism
discriminative local representation
multi-scale feature information
high-order item
author_facet Chengji Xu
Xiaofeng Wang
Yadong Yang
author_sort Chengji Xu
title Selective Multi-Scale Feature Learning by Discriminative Local Representation
title_short Selective Multi-Scale Feature Learning by Discriminative Local Representation
title_full Selective Multi-Scale Feature Learning by Discriminative Local Representation
title_fullStr Selective Multi-Scale Feature Learning by Discriminative Local Representation
title_full_unstemmed Selective Multi-Scale Feature Learning by Discriminative Local Representation
title_sort selective multi-scale feature learning by discriminative local representation
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description In the computer vision community, the general trend has been to capture and select discriminative features in order to yield significantly better performance. Recent advances in attention mechanism proposed several attention blocks to adaptively recalibrate the feature response. However, most of them overlooked the context information at a multi-scale level. In this paper, we propose a simple yet effective building block for ResNeXt-style backbones, namely discriminative local representation (DLR) module, which allows discriminative local representation learning for multi-scale feature information across multi-parallel branches. Our DLR module contains two sub-modules: channel selective module (CSM) and spatial selective module (SSM). Given an intermediate feature map, the CSM first selectively generates the channel-wise attention maps and recalibrates the response from different branches according to the weight vector calculated by softmax layer. And then, the SSM further captures the spatial discriminative information at different scales respectively and emphasizes the interdependent channel maps. Besides, we place a high-order item during the process of multi-branch fusion and residual connection to enhance the intensity of structure nonlinearity. Various DLR modules can be stacked to a deep convolution network named DLRNet. To validate our DLRNet, we conduct comprehensive experiments on classification benchmarks (i.e. CIFAR10, CIFAR100 and ImageNet-1K), as well as two publicly available fine-grained datasets (i.e. CUB-200-2011 and Stanford Dogs). The experiments show consistent improvement gains over previous baseline models with reasonable overhead, and demonstrate the capability of our proposed method for discriminative local representation.
topic Attention mechanism
discriminative local representation
multi-scale feature information
high-order item
url https://ieeexplore.ieee.org/document/8825814/
work_keys_str_mv AT chengjixu selectivemultiscalefeaturelearningbydiscriminativelocalrepresentation
AT xiaofengwang selectivemultiscalefeaturelearningbydiscriminativelocalrepresentation
AT yadongyang selectivemultiscalefeaturelearningbydiscriminativelocalrepresentation
_version_ 1724189075006554112