Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification

Conventional image classification models commonly adopt a single feature vector to represent informative contents. However, a single image feature system can hardly extract the entirety of the information contained in images, and traditional encoding methods have a large loss of feature information....

Full description

Bibliographic Details
Main Authors: Mengxi Xu, Yingshu Lu, Xiaobin Wu
Format: Article
Language:English
Published: Hindawi-Wiley 2020-01-01
Series:Wireless Communications and Mobile Computing
Online Access:http://dx.doi.org/10.1155/2020/8838454
id doaj-1a638a5762874325bdd7b0bceed5f897
record_format Article
spelling doaj-1a638a5762874325bdd7b0bceed5f8972020-11-25T03:42:49ZengHindawi-WileyWireless Communications and Mobile Computing1530-86691530-86772020-01-01202010.1155/2020/88384548838454Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and ClassificationMengxi Xu0Yingshu Lu1Xiaobin Wu2School of Computer Engineering, Nanjing Institute of Technology, Nanjing 211167, ChinaHuawei Technologies Co., Ltd., Nanjing 210000, ChinaSchool of Computer Engineering, Nanjing Institute of Technology, Nanjing 211167, ChinaConventional image classification models commonly adopt a single feature vector to represent informative contents. However, a single image feature system can hardly extract the entirety of the information contained in images, and traditional encoding methods have a large loss of feature information. Aiming to solve this problem, this paper proposes a feature fusion-based image classification model. This model combines the principal component analysis (PCA) algorithm, processed scale invariant feature transform (P-SIFT) and color naming (CN) features to generate mutually independent image representation factors. At the encoding stage of the scale-invariant feature transform (SIFT) feature, the bag-of-visual-word model (BOVW) is used for feature reconstruction. Simultaneously, in order to introduce the spatial information to our extracted features, the rotation invariant spatial pyramid mapping method is introduced for the P-SIFT and CN feature division and representation. At the stage of feature fusion, we adopt a support vector machine with two kernels (SVM-2K) algorithm, which divides the training process into two stages and finally learns the knowledge from the corresponding kernel matrix for the classification performance improvement. The experiments show that the proposed method can effectively improve the accuracy of image description and the precision of image classification.http://dx.doi.org/10.1155/2020/8838454
collection DOAJ
language English
format Article
sources DOAJ
author Mengxi Xu
Yingshu Lu
Xiaobin Wu
spellingShingle Mengxi Xu
Yingshu Lu
Xiaobin Wu
Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification
Wireless Communications and Mobile Computing
author_facet Mengxi Xu
Yingshu Lu
Xiaobin Wu
author_sort Mengxi Xu
title Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification
title_short Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification
title_full Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification
title_fullStr Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification
title_full_unstemmed Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification
title_sort annular spatial pyramid mapping and feature fusion-based image coding representation and classification
publisher Hindawi-Wiley
series Wireless Communications and Mobile Computing
issn 1530-8669
1530-8677
publishDate 2020-01-01
description Conventional image classification models commonly adopt a single feature vector to represent informative contents. However, a single image feature system can hardly extract the entirety of the information contained in images, and traditional encoding methods have a large loss of feature information. Aiming to solve this problem, this paper proposes a feature fusion-based image classification model. This model combines the principal component analysis (PCA) algorithm, processed scale invariant feature transform (P-SIFT) and color naming (CN) features to generate mutually independent image representation factors. At the encoding stage of the scale-invariant feature transform (SIFT) feature, the bag-of-visual-word model (BOVW) is used for feature reconstruction. Simultaneously, in order to introduce the spatial information to our extracted features, the rotation invariant spatial pyramid mapping method is introduced for the P-SIFT and CN feature division and representation. At the stage of feature fusion, we adopt a support vector machine with two kernels (SVM-2K) algorithm, which divides the training process into two stages and finally learns the knowledge from the corresponding kernel matrix for the classification performance improvement. The experiments show that the proposed method can effectively improve the accuracy of image description and the precision of image classification.
url http://dx.doi.org/10.1155/2020/8838454
work_keys_str_mv AT mengxixu annularspatialpyramidmappingandfeaturefusionbasedimagecodingrepresentationandclassification
AT yingshulu annularspatialpyramidmappingandfeaturefusionbasedimagecodingrepresentationandclassification
AT xiaobinwu annularspatialpyramidmappingandfeaturefusionbasedimagecodingrepresentationandclassification
_version_ 1715137747303268352