A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval

Abstract Learning effective feature descriptors that bridge the semantic gap between low-level visual features directly extracted from image pixels and the corresponding high-level semantics perceived by humans is a challenging task in image retrieval. This paper proposes a hybrid deep learning arch...

Full description

Bibliographic Details
Main Authors: K. S. Arun, V. K. Govindan
Format: Article
Language:English
Published: SpringerOpen 2018-04-01
Series:Data Science and Engineering
Subjects:
Online Access:http://link.springer.com/article/10.1007/s41019-018-0063-7
id doaj-0654c4b16e184eb9b3ac42650b419ed7
record_format Article
spelling doaj-0654c4b16e184eb9b3ac42650b419ed72021-03-02T10:38:40ZengSpringerOpenData Science and Engineering2364-11852364-15412018-04-013216619510.1007/s41019-018-0063-7A Hybrid Deep Learning Architecture for Latent Topic-based Image RetrievalK. S. Arun0V. K. Govindan1Department of Computer Science and Engineering, National Institute of Technology CalicutDepartment of Computer Science and Engineering, National Institute of Technology CalicutAbstract Learning effective feature descriptors that bridge the semantic gap between low-level visual features directly extracted from image pixels and the corresponding high-level semantics perceived by humans is a challenging task in image retrieval. This paper proposes a hybrid deep learning architecture (HDLA) that generates sparse latent topic-based representation with the objective of minimizing the semantic gap problem in image retrieval. In fact, HDLA has a deep network structure with a constrained replicated Softmax Model in the lower layer and constrained restricted Boltzmann machines in the upper layers. The advantage of HDLA is that there exist nonnegativity restrictions on the model weights together with $$\ell _1$$ ℓ1 -sparsity enforced over the activations of the hidden layer nodes of the network. This, in turn, enhances the modeling power of the network and leads to sparse, parts-based latent topic representation of images. Experimental results on various benchmark datasets show that the proposed model exhibits better generalization ability and the resulting high-level abstraction yields better retrieval performance as compared to state-of-the-art latent topic-based image representation schemes.http://link.springer.com/article/10.1007/s41019-018-0063-7Image retrievalDeep learningLatent topics
collection DOAJ
language English
format Article
sources DOAJ
author K. S. Arun
V. K. Govindan
spellingShingle K. S. Arun
V. K. Govindan
A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
Data Science and Engineering
Image retrieval
Deep learning
Latent topics
author_facet K. S. Arun
V. K. Govindan
author_sort K. S. Arun
title A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
title_short A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
title_full A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
title_fullStr A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
title_full_unstemmed A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
title_sort hybrid deep learning architecture for latent topic-based image retrieval
publisher SpringerOpen
series Data Science and Engineering
issn 2364-1185
2364-1541
publishDate 2018-04-01
description Abstract Learning effective feature descriptors that bridge the semantic gap between low-level visual features directly extracted from image pixels and the corresponding high-level semantics perceived by humans is a challenging task in image retrieval. This paper proposes a hybrid deep learning architecture (HDLA) that generates sparse latent topic-based representation with the objective of minimizing the semantic gap problem in image retrieval. In fact, HDLA has a deep network structure with a constrained replicated Softmax Model in the lower layer and constrained restricted Boltzmann machines in the upper layers. The advantage of HDLA is that there exist nonnegativity restrictions on the model weights together with $$\ell _1$$ ℓ1 -sparsity enforced over the activations of the hidden layer nodes of the network. This, in turn, enhances the modeling power of the network and leads to sparse, parts-based latent topic representation of images. Experimental results on various benchmark datasets show that the proposed model exhibits better generalization ability and the resulting high-level abstraction yields better retrieval performance as compared to state-of-the-art latent topic-based image representation schemes.
topic Image retrieval
Deep learning
Latent topics
url http://link.springer.com/article/10.1007/s41019-018-0063-7
work_keys_str_mv AT ksarun ahybriddeeplearningarchitectureforlatenttopicbasedimageretrieval
AT vkgovindan ahybriddeeplearningarchitectureforlatenttopicbasedimageretrieval
AT ksarun hybriddeeplearningarchitectureforlatenttopicbasedimageretrieval
AT vkgovindan hybriddeeplearningarchitectureforlatenttopicbasedimageretrieval
_version_ 1724236379979776000