M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification

Vector of locally aggregated descriptor (VLAD) coding has become an efficient feature coding model for retrieval and classification. In some recent works, the VLAD coding method is extended to a deep feature coding model which is called NetVLAD. NetVLAD improves significantly over the original VLAD...

Full description

Bibliographic Details
Main Authors: Boheng Chen, Jie Li, Gang Wei, Biyun Ma
Format: Article
Language:English
Published: MDPI AG 2018-05-01
Series:Entropy
Subjects:
Online Access:http://www.mdpi.com/1099-4300/20/5/341
id doaj-82989cd358e643bcac744372dc4d49da
record_format Article
spelling doaj-82989cd358e643bcac744372dc4d49da2020-11-24T23:56:53ZengMDPI AGEntropy1099-43002018-05-0120534110.3390/e20050341e20050341M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual ClassificationBoheng Chen0Jie Li1Gang Wei2Biyun Ma3The National Engineering Technology Research Center for Mobile Ultrasonic Detection, School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510641, ChinaThe National Engineering Technology Research Center for Mobile Ultrasonic Detection, School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510641, ChinaThe National Engineering Technology Research Center for Mobile Ultrasonic Detection, School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510641, ChinaThe National Engineering Technology Research Center for Mobile Ultrasonic Detection, School of Electronics and Information Engineering, South China University of Technology, Guangzhou 510641, ChinaVector of locally aggregated descriptor (VLAD) coding has become an efficient feature coding model for retrieval and classification. In some recent works, the VLAD coding method is extended to a deep feature coding model which is called NetVLAD. NetVLAD improves significantly over the original VLAD method. Although the NetVLAD model has shown its potential for retrieval and classification, the discriminative ability is not fully researched. In this paper, we propose a new end-to-end feature coding network which is more discriminative than the NetVLAD model. First, we propose a sparsely-adaptive and covariance VLAD model. Next, we derive the back propagation models of all the proposed layers and extend the proposed feature coding model to an end-to-end neural network. Finally, we construct a multi-path feature coding network which aggregates multiple newly-designed feature coding networks for visual classification. Some experimental results show that our feature coding network is very effective for visual classification.http://www.mdpi.com/1099-4300/20/5/341deep convolutional networkdeep feature coding networkmulti-path feature coding networksparsely-adaptive and covariance VLAD codingvisual classification
collection DOAJ
language English
format Article
sources DOAJ
author Boheng Chen
Jie Li
Gang Wei
Biyun Ma
spellingShingle Boheng Chen
Jie Li
Gang Wei
Biyun Ma
M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification
Entropy
deep convolutional network
deep feature coding network
multi-path feature coding network
sparsely-adaptive and covariance VLAD coding
visual classification
author_facet Boheng Chen
Jie Li
Gang Wei
Biyun Ma
author_sort Boheng Chen
title M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification
title_short M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification
title_full M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification
title_fullStr M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification
title_full_unstemmed M-SAC-VLADNet: A Multi-Path Deep Feature Coding Model for Visual Classification
title_sort m-sac-vladnet: a multi-path deep feature coding model for visual classification
publisher MDPI AG
series Entropy
issn 1099-4300
publishDate 2018-05-01
description Vector of locally aggregated descriptor (VLAD) coding has become an efficient feature coding model for retrieval and classification. In some recent works, the VLAD coding method is extended to a deep feature coding model which is called NetVLAD. NetVLAD improves significantly over the original VLAD method. Although the NetVLAD model has shown its potential for retrieval and classification, the discriminative ability is not fully researched. In this paper, we propose a new end-to-end feature coding network which is more discriminative than the NetVLAD model. First, we propose a sparsely-adaptive and covariance VLAD model. Next, we derive the back propagation models of all the proposed layers and extend the proposed feature coding model to an end-to-end neural network. Finally, we construct a multi-path feature coding network which aggregates multiple newly-designed feature coding networks for visual classification. Some experimental results show that our feature coding network is very effective for visual classification.
topic deep convolutional network
deep feature coding network
multi-path feature coding network
sparsely-adaptive and covariance VLAD coding
visual classification
url http://www.mdpi.com/1099-4300/20/5/341
work_keys_str_mv AT bohengchen msacvladnetamultipathdeepfeaturecodingmodelforvisualclassification
AT jieli msacvladnetamultipathdeepfeaturecodingmodelforvisualclassification
AT gangwei msacvladnetamultipathdeepfeaturecodingmodelforvisualclassification
AT biyunma msacvladnetamultipathdeepfeaturecodingmodelforvisualclassification
_version_ 1725455998103060480