Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement

One of the challenges in fine-grained classification is that subcategories with significant similarity are hard to be distinguished due to the equal treatment of all subcategories in existing algorithms. In order to solve this problem, a fine-grained image classification method by combining a biline...

Full description

Bibliographic Details
Main Authors: Xinghua Dai, Shengrong Gong, Shan Zhong, Zongming Bao
Format: Article
Language:English
Published: MDPI AG 2019-01-01
Series:Applied Sciences
Subjects:
Online Access:http://www.mdpi.com/2076-3417/9/2/301
id doaj-55f04f39b34842b1bdb67d164be6e6dc
record_format Article
spelling doaj-55f04f39b34842b1bdb67d164be6e6dc2020-11-24T22:16:24ZengMDPI AGApplied Sciences2076-34172019-01-019230110.3390/app9020301app9020301Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity MeasurementXinghua Dai0Shengrong Gong1Shan Zhong2Zongming Bao3School of Computer Science and Technology, Soochow University, Suzhou 215000, ChinaSchool of Computer Science and Technology, Soochow University, Suzhou 215000, ChinaSchool of Computer Science and Engineering, Changshu Institute of Technology, Suzhou 215000, ChinaSchool of Computer Science and Technology, Soochow University, Suzhou 215000, ChinaOne of the challenges in fine-grained classification is that subcategories with significant similarity are hard to be distinguished due to the equal treatment of all subcategories in existing algorithms. In order to solve this problem, a fine-grained image classification method by combining a bilinear convolutional neural network (B-CNN) and the measurement of subcategory similarities is proposed. Firstly, an improved weakly supervised localization method is designed to obtain the bounding box of the main object, which allows the model to eliminate the influence of background noise and obtain more accurate features. Then, sample features in the training set are computed by B-CNN so that the fuzzing similarity matrix for measuring interclass similarities can be obtained. To further improve classification accuracy, the loss function is designed by weighting triplet loss and softmax loss. Extensive experiments implemented on two benchmarks datasets, Stanford Cars-196 and Caltech-UCSD Birds-200-2011 (CUB-200-2011), show that the newly proposed method outperforms in accuracy several state-of-the-art weakly supervised classification models.http://www.mdpi.com/2076-3417/9/2/301fine-grained classificationB-CNNweakly supervised localizationloss function
collection DOAJ
language English
format Article
sources DOAJ
author Xinghua Dai
Shengrong Gong
Shan Zhong
Zongming Bao
spellingShingle Xinghua Dai
Shengrong Gong
Shan Zhong
Zongming Bao
Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
Applied Sciences
fine-grained classification
B-CNN
weakly supervised localization
loss function
author_facet Xinghua Dai
Shengrong Gong
Shan Zhong
Zongming Bao
author_sort Xinghua Dai
title Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
title_short Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
title_full Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
title_fullStr Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
title_full_unstemmed Bilinear CNN Model for Fine-Grained Classification Based on Subcategory-Similarity Measurement
title_sort bilinear cnn model for fine-grained classification based on subcategory-similarity measurement
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2019-01-01
description One of the challenges in fine-grained classification is that subcategories with significant similarity are hard to be distinguished due to the equal treatment of all subcategories in existing algorithms. In order to solve this problem, a fine-grained image classification method by combining a bilinear convolutional neural network (B-CNN) and the measurement of subcategory similarities is proposed. Firstly, an improved weakly supervised localization method is designed to obtain the bounding box of the main object, which allows the model to eliminate the influence of background noise and obtain more accurate features. Then, sample features in the training set are computed by B-CNN so that the fuzzing similarity matrix for measuring interclass similarities can be obtained. To further improve classification accuracy, the loss function is designed by weighting triplet loss and softmax loss. Extensive experiments implemented on two benchmarks datasets, Stanford Cars-196 and Caltech-UCSD Birds-200-2011 (CUB-200-2011), show that the newly proposed method outperforms in accuracy several state-of-the-art weakly supervised classification models.
topic fine-grained classification
B-CNN
weakly supervised localization
loss function
url http://www.mdpi.com/2076-3417/9/2/301
work_keys_str_mv AT xinghuadai bilinearcnnmodelforfinegrainedclassificationbasedonsubcategorysimilaritymeasurement
AT shengronggong bilinearcnnmodelforfinegrainedclassificationbasedonsubcategorysimilaritymeasurement
AT shanzhong bilinearcnnmodelforfinegrainedclassificationbasedonsubcategorysimilaritymeasurement
AT zongmingbao bilinearcnnmodelforfinegrainedclassificationbasedonsubcategorysimilaritymeasurement
_version_ 1725790090882449408