AVBH: Asymmetric Learning to Hash with Variable Bit Encoding

Nearest neighbour search (NNS) is the core of large data retrieval. Learning to hash is an effective way to solve the problems by representing high-dimensional data into a compact binary code. However, existing learning to hash methods needs long bit encoding to ensure the accuracy of query, and lon...

Full description

Bibliographic Details
Main Authors: Yanduo Ren, Jiangbo Qian, Yihong Dong, Yu Xin, Huahui Chen
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2020/2424381
id doaj-83a33ecff8da4d8fae7d4e780d3f1432
record_format Article
spelling doaj-83a33ecff8da4d8fae7d4e780d3f14322021-07-02T05:18:11ZengHindawi LimitedScientific Programming1058-92441875-919X2020-01-01202010.1155/2020/24243812424381AVBH: Asymmetric Learning to Hash with Variable Bit EncodingYanduo Ren0Jiangbo Qian1Yihong Dong2Yu Xin3Huahui Chen4Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaNearest neighbour search (NNS) is the core of large data retrieval. Learning to hash is an effective way to solve the problems by representing high-dimensional data into a compact binary code. However, existing learning to hash methods needs long bit encoding to ensure the accuracy of query, and long bit encoding brings large cost of storage, which severely restricts the long bit encoding in the application of big data. An asymmetric learning to hash with variable bit encoding algorithm (AVBH) is proposed to solve the problem. The AVBH hash algorithm uses two types of hash mapping functions to encode the dataset and the query set into different length bits. For datasets, the hash code frequencies of datasets after random Fourier feature encoding are statistically analysed. The hash code with high frequency is compressed into a longer coding representation, and the hash code with low frequency is compressed into a shorter coding representation. The query point is quantized to a long bit hash code and compared with the same length cascade concatenated data point. Experiments on public datasets show that the proposed algorithm effectively reduces the cost of storage and improves the accuracy of query.http://dx.doi.org/10.1155/2020/2424381
collection DOAJ
language English
format Article
sources DOAJ
author Yanduo Ren
Jiangbo Qian
Yihong Dong
Yu Xin
Huahui Chen
spellingShingle Yanduo Ren
Jiangbo Qian
Yihong Dong
Yu Xin
Huahui Chen
AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
Scientific Programming
author_facet Yanduo Ren
Jiangbo Qian
Yihong Dong
Yu Xin
Huahui Chen
author_sort Yanduo Ren
title AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
title_short AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
title_full AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
title_fullStr AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
title_full_unstemmed AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
title_sort avbh: asymmetric learning to hash with variable bit encoding
publisher Hindawi Limited
series Scientific Programming
issn 1058-9244
1875-919X
publishDate 2020-01-01
description Nearest neighbour search (NNS) is the core of large data retrieval. Learning to hash is an effective way to solve the problems by representing high-dimensional data into a compact binary code. However, existing learning to hash methods needs long bit encoding to ensure the accuracy of query, and long bit encoding brings large cost of storage, which severely restricts the long bit encoding in the application of big data. An asymmetric learning to hash with variable bit encoding algorithm (AVBH) is proposed to solve the problem. The AVBH hash algorithm uses two types of hash mapping functions to encode the dataset and the query set into different length bits. For datasets, the hash code frequencies of datasets after random Fourier feature encoding are statistically analysed. The hash code with high frequency is compressed into a longer coding representation, and the hash code with low frequency is compressed into a shorter coding representation. The query point is quantized to a long bit hash code and compared with the same length cascade concatenated data point. Experiments on public datasets show that the proposed algorithm effectively reduces the cost of storage and improves the accuracy of query.
url http://dx.doi.org/10.1155/2020/2424381
work_keys_str_mv AT yanduoren avbhasymmetriclearningtohashwithvariablebitencoding
AT jiangboqian avbhasymmetriclearningtohashwithvariablebitencoding
AT yihongdong avbhasymmetriclearningtohashwithvariablebitencoding
AT yuxin avbhasymmetriclearningtohashwithvariablebitencoding
AT huahuichen avbhasymmetriclearningtohashwithvariablebitencoding
_version_ 1721338878777884672