A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval
Abstract Learning effective feature descriptors that bridge the semantic gap between low-level visual features directly extracted from image pixels and the corresponding high-level semantics perceived by humans is a challenging task in image retrieval. This paper proposes a hybrid deep learning arch...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2018-04-01
|
Series: | Data Science and Engineering |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1007/s41019-018-0063-7 |
id |
doaj-0654c4b16e184eb9b3ac42650b419ed7 |
---|---|
record_format |
Article |
spelling |
doaj-0654c4b16e184eb9b3ac42650b419ed72021-03-02T10:38:40ZengSpringerOpenData Science and Engineering2364-11852364-15412018-04-013216619510.1007/s41019-018-0063-7A Hybrid Deep Learning Architecture for Latent Topic-based Image RetrievalK. S. Arun0V. K. Govindan1Department of Computer Science and Engineering, National Institute of Technology CalicutDepartment of Computer Science and Engineering, National Institute of Technology CalicutAbstract Learning effective feature descriptors that bridge the semantic gap between low-level visual features directly extracted from image pixels and the corresponding high-level semantics perceived by humans is a challenging task in image retrieval. This paper proposes a hybrid deep learning architecture (HDLA) that generates sparse latent topic-based representation with the objective of minimizing the semantic gap problem in image retrieval. In fact, HDLA has a deep network structure with a constrained replicated Softmax Model in the lower layer and constrained restricted Boltzmann machines in the upper layers. The advantage of HDLA is that there exist nonnegativity restrictions on the model weights together with $$\ell _1$$ ℓ1 -sparsity enforced over the activations of the hidden layer nodes of the network. This, in turn, enhances the modeling power of the network and leads to sparse, parts-based latent topic representation of images. Experimental results on various benchmark datasets show that the proposed model exhibits better generalization ability and the resulting high-level abstraction yields better retrieval performance as compared to state-of-the-art latent topic-based image representation schemes.http://link.springer.com/article/10.1007/s41019-018-0063-7Image retrievalDeep learningLatent topics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
K. S. Arun V. K. Govindan |
spellingShingle |
K. S. Arun V. K. Govindan A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval Data Science and Engineering Image retrieval Deep learning Latent topics |
author_facet |
K. S. Arun V. K. Govindan |
author_sort |
K. S. Arun |
title |
A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval |
title_short |
A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval |
title_full |
A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval |
title_fullStr |
A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval |
title_full_unstemmed |
A Hybrid Deep Learning Architecture for Latent Topic-based Image Retrieval |
title_sort |
hybrid deep learning architecture for latent topic-based image retrieval |
publisher |
SpringerOpen |
series |
Data Science and Engineering |
issn |
2364-1185 2364-1541 |
publishDate |
2018-04-01 |
description |
Abstract Learning effective feature descriptors that bridge the semantic gap between low-level visual features directly extracted from image pixels and the corresponding high-level semantics perceived by humans is a challenging task in image retrieval. This paper proposes a hybrid deep learning architecture (HDLA) that generates sparse latent topic-based representation with the objective of minimizing the semantic gap problem in image retrieval. In fact, HDLA has a deep network structure with a constrained replicated Softmax Model in the lower layer and constrained restricted Boltzmann machines in the upper layers. The advantage of HDLA is that there exist nonnegativity restrictions on the model weights together with $$\ell _1$$ ℓ1 -sparsity enforced over the activations of the hidden layer nodes of the network. This, in turn, enhances the modeling power of the network and leads to sparse, parts-based latent topic representation of images. Experimental results on various benchmark datasets show that the proposed model exhibits better generalization ability and the resulting high-level abstraction yields better retrieval performance as compared to state-of-the-art latent topic-based image representation schemes. |
topic |
Image retrieval Deep learning Latent topics |
url |
http://link.springer.com/article/10.1007/s41019-018-0063-7 |
work_keys_str_mv |
AT ksarun ahybriddeeplearningarchitectureforlatenttopicbasedimageretrieval AT vkgovindan ahybriddeeplearningarchitectureforlatenttopicbasedimageretrieval AT ksarun hybriddeeplearningarchitectureforlatenttopicbasedimageretrieval AT vkgovindan hybriddeeplearningarchitectureforlatenttopicbasedimageretrieval |
_version_ |
1724236379979776000 |