Selective Multi-Scale Feature Learning by Discriminative Local Representation
In the computer vision community, the general trend has been to capture and select discriminative features in order to yield significantly better performance. Recent advances in attention mechanism proposed several attention blocks to adaptively recalibrate the feature response. However, most of the...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8825814/ |
id |
doaj-4a2c980a79d4491ebd97732bf72e9323 |
---|---|
record_format |
Article |
spelling |
doaj-4a2c980a79d4491ebd97732bf72e93232021-03-29T23:42:25ZengIEEEIEEE Access2169-35362019-01-01712732712733810.1109/ACCESS.2019.29397168825814Selective Multi-Scale Feature Learning by Discriminative Local RepresentationChengji Xu0https://orcid.org/0000-0003-0725-0902Xiaofeng Wang1Yadong Yang2College of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaCollege of Information Engineering, Shanghai Maritime University, Shanghai, ChinaIn the computer vision community, the general trend has been to capture and select discriminative features in order to yield significantly better performance. Recent advances in attention mechanism proposed several attention blocks to adaptively recalibrate the feature response. However, most of them overlooked the context information at a multi-scale level. In this paper, we propose a simple yet effective building block for ResNeXt-style backbones, namely discriminative local representation (DLR) module, which allows discriminative local representation learning for multi-scale feature information across multi-parallel branches. Our DLR module contains two sub-modules: channel selective module (CSM) and spatial selective module (SSM). Given an intermediate feature map, the CSM first selectively generates the channel-wise attention maps and recalibrates the response from different branches according to the weight vector calculated by softmax layer. And then, the SSM further captures the spatial discriminative information at different scales respectively and emphasizes the interdependent channel maps. Besides, we place a high-order item during the process of multi-branch fusion and residual connection to enhance the intensity of structure nonlinearity. Various DLR modules can be stacked to a deep convolution network named DLRNet. To validate our DLRNet, we conduct comprehensive experiments on classification benchmarks (i.e. CIFAR10, CIFAR100 and ImageNet-1K), as well as two publicly available fine-grained datasets (i.e. CUB-200-2011 and Stanford Dogs). The experiments show consistent improvement gains over previous baseline models with reasonable overhead, and demonstrate the capability of our proposed method for discriminative local representation.https://ieeexplore.ieee.org/document/8825814/Attention mechanismdiscriminative local representationmulti-scale feature informationhigh-order item |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chengji Xu Xiaofeng Wang Yadong Yang |
spellingShingle |
Chengji Xu Xiaofeng Wang Yadong Yang Selective Multi-Scale Feature Learning by Discriminative Local Representation IEEE Access Attention mechanism discriminative local representation multi-scale feature information high-order item |
author_facet |
Chengji Xu Xiaofeng Wang Yadong Yang |
author_sort |
Chengji Xu |
title |
Selective Multi-Scale Feature Learning by Discriminative Local Representation |
title_short |
Selective Multi-Scale Feature Learning by Discriminative Local Representation |
title_full |
Selective Multi-Scale Feature Learning by Discriminative Local Representation |
title_fullStr |
Selective Multi-Scale Feature Learning by Discriminative Local Representation |
title_full_unstemmed |
Selective Multi-Scale Feature Learning by Discriminative Local Representation |
title_sort |
selective multi-scale feature learning by discriminative local representation |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
In the computer vision community, the general trend has been to capture and select discriminative features in order to yield significantly better performance. Recent advances in attention mechanism proposed several attention blocks to adaptively recalibrate the feature response. However, most of them overlooked the context information at a multi-scale level. In this paper, we propose a simple yet effective building block for ResNeXt-style backbones, namely discriminative local representation (DLR) module, which allows discriminative local representation learning for multi-scale feature information across multi-parallel branches. Our DLR module contains two sub-modules: channel selective module (CSM) and spatial selective module (SSM). Given an intermediate feature map, the CSM first selectively generates the channel-wise attention maps and recalibrates the response from different branches according to the weight vector calculated by softmax layer. And then, the SSM further captures the spatial discriminative information at different scales respectively and emphasizes the interdependent channel maps. Besides, we place a high-order item during the process of multi-branch fusion and residual connection to enhance the intensity of structure nonlinearity. Various DLR modules can be stacked to a deep convolution network named DLRNet. To validate our DLRNet, we conduct comprehensive experiments on classification benchmarks (i.e. CIFAR10, CIFAR100 and ImageNet-1K), as well as two publicly available fine-grained datasets (i.e. CUB-200-2011 and Stanford Dogs). The experiments show consistent improvement gains over previous baseline models with reasonable overhead, and demonstrate the capability of our proposed method for discriminative local representation. |
topic |
Attention mechanism discriminative local representation multi-scale feature information high-order item |
url |
https://ieeexplore.ieee.org/document/8825814/ |
work_keys_str_mv |
AT chengjixu selectivemultiscalefeaturelearningbydiscriminativelocalrepresentation AT xiaofengwang selectivemultiscalefeaturelearningbydiscriminativelocalrepresentation AT yadongyang selectivemultiscalefeaturelearningbydiscriminativelocalrepresentation |
_version_ |
1724189075006554112 |