Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification

Fine-grained image classification has seen a great improvement benefiting from the advantages of deep learning techniques. Most fine-grained image classification methods focus on extracting discriminative features and combining the global features with the local ones. However, the accuracy is limite...

Full description

Bibliographic Details
Main Authors: Fenglei Wang, Hao Zhou, Shuohao Li, Jun Lei, Jun Zhang
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/9/1511
id doaj-04b49d45a6f641ffb07f707701c5bdef
record_format Article
spelling doaj-04b49d45a6f641ffb07f707701c5bdef2020-11-25T03:06:34ZengMDPI AGSymmetry2073-89942020-09-01121511151110.3390/sym12091511Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image ClassificationFenglei Wang0Hao Zhou1Shuohao Li2Jun Lei3Jun Zhang4Science and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410003, ChinaScience and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410003, ChinaScience and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410003, ChinaScience and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410003, ChinaScience and Technology on Information Systems Engineering Laboratory, National University of Defense Technology, Changsha 410003, ChinaFine-grained image classification has seen a great improvement benefiting from the advantages of deep learning techniques. Most fine-grained image classification methods focus on extracting discriminative features and combining the global features with the local ones. However, the accuracy is limited due to the inter-class similarity and the inner-class divergence as well as the lack of enough labelled images to train a deep network which can generalize to fine-grained classes. To deal with these problems, we develop an algorithm which combines Maximizing the Mutual Information (MMI) with the Learning Attention (LA). We make use of MMI to distill knowledge from the image pairs which contain the same object. Meanwhile we take advantage of the LA mechanism to find the salient region of the image to enhance the information distillation. Our model can extract more discriminative semantic features and improve the performance on fine-grained image classification. Our model has a symmetric structure, in which the paired images are inputted into the same network to extract the local and global features for the subsequent MMI and LA modules. We train the model by maximizing the mutual information and minimizing the cross-entropy stage by stage alternatively. Experiments show that our model can improve the performance of the fine-grained image classification effectively.https://www.mdpi.com/2073-8994/12/9/1511image classificationmutual informationfine-grainedattention network
collection DOAJ
language English
format Article
sources DOAJ
author Fenglei Wang
Hao Zhou
Shuohao Li
Jun Lei
Jun Zhang
spellingShingle Fenglei Wang
Hao Zhou
Shuohao Li
Jun Lei
Jun Zhang
Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification
Symmetry
image classification
mutual information
fine-grained
attention network
author_facet Fenglei Wang
Hao Zhou
Shuohao Li
Jun Lei
Jun Zhang
author_sort Fenglei Wang
title Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification
title_short Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification
title_full Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification
title_fullStr Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification
title_full_unstemmed Convolutional Attention Network with Maximizing Mutual Information for Fine-Grained Image Classification
title_sort convolutional attention network with maximizing mutual information for fine-grained image classification
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-09-01
description Fine-grained image classification has seen a great improvement benefiting from the advantages of deep learning techniques. Most fine-grained image classification methods focus on extracting discriminative features and combining the global features with the local ones. However, the accuracy is limited due to the inter-class similarity and the inner-class divergence as well as the lack of enough labelled images to train a deep network which can generalize to fine-grained classes. To deal with these problems, we develop an algorithm which combines Maximizing the Mutual Information (MMI) with the Learning Attention (LA). We make use of MMI to distill knowledge from the image pairs which contain the same object. Meanwhile we take advantage of the LA mechanism to find the salient region of the image to enhance the information distillation. Our model can extract more discriminative semantic features and improve the performance on fine-grained image classification. Our model has a symmetric structure, in which the paired images are inputted into the same network to extract the local and global features for the subsequent MMI and LA modules. We train the model by maximizing the mutual information and minimizing the cross-entropy stage by stage alternatively. Experiments show that our model can improve the performance of the fine-grained image classification effectively.
topic image classification
mutual information
fine-grained
attention network
url https://www.mdpi.com/2073-8994/12/9/1511
work_keys_str_mv AT fengleiwang convolutionalattentionnetworkwithmaximizingmutualinformationforfinegrainedimageclassification
AT haozhou convolutionalattentionnetworkwithmaximizingmutualinformationforfinegrainedimageclassification
AT shuohaoli convolutionalattentionnetworkwithmaximizingmutualinformationforfinegrainedimageclassification
AT junlei convolutionalattentionnetworkwithmaximizingmutualinformationforfinegrainedimageclassification
AT junzhang convolutionalattentionnetworkwithmaximizingmutualinformationforfinegrainedimageclassification
_version_ 1724673613435502592