AVBH: Asymmetric Learning to Hash with Variable Bit Encoding
Nearest neighbour search (NNS) is the core of large data retrieval. Learning to hash is an effective way to solve the problems by representing high-dimensional data into a compact binary code. However, existing learning to hash methods needs long bit encoding to ensure the accuracy of query, and lon...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2020-01-01
|
Series: | Scientific Programming |
Online Access: | http://dx.doi.org/10.1155/2020/2424381 |
id |
doaj-83a33ecff8da4d8fae7d4e780d3f1432 |
---|---|
record_format |
Article |
spelling |
doaj-83a33ecff8da4d8fae7d4e780d3f14322021-07-02T05:18:11ZengHindawi LimitedScientific Programming1058-92441875-919X2020-01-01202010.1155/2020/24243812424381AVBH: Asymmetric Learning to Hash with Variable Bit EncodingYanduo Ren0Jiangbo Qian1Yihong Dong2Yu Xin3Huahui Chen4Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaFaculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, Zhejiang 315211, ChinaNearest neighbour search (NNS) is the core of large data retrieval. Learning to hash is an effective way to solve the problems by representing high-dimensional data into a compact binary code. However, existing learning to hash methods needs long bit encoding to ensure the accuracy of query, and long bit encoding brings large cost of storage, which severely restricts the long bit encoding in the application of big data. An asymmetric learning to hash with variable bit encoding algorithm (AVBH) is proposed to solve the problem. The AVBH hash algorithm uses two types of hash mapping functions to encode the dataset and the query set into different length bits. For datasets, the hash code frequencies of datasets after random Fourier feature encoding are statistically analysed. The hash code with high frequency is compressed into a longer coding representation, and the hash code with low frequency is compressed into a shorter coding representation. The query point is quantized to a long bit hash code and compared with the same length cascade concatenated data point. Experiments on public datasets show that the proposed algorithm effectively reduces the cost of storage and improves the accuracy of query.http://dx.doi.org/10.1155/2020/2424381 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yanduo Ren Jiangbo Qian Yihong Dong Yu Xin Huahui Chen |
spellingShingle |
Yanduo Ren Jiangbo Qian Yihong Dong Yu Xin Huahui Chen AVBH: Asymmetric Learning to Hash with Variable Bit Encoding Scientific Programming |
author_facet |
Yanduo Ren Jiangbo Qian Yihong Dong Yu Xin Huahui Chen |
author_sort |
Yanduo Ren |
title |
AVBH: Asymmetric Learning to Hash with Variable Bit Encoding |
title_short |
AVBH: Asymmetric Learning to Hash with Variable Bit Encoding |
title_full |
AVBH: Asymmetric Learning to Hash with Variable Bit Encoding |
title_fullStr |
AVBH: Asymmetric Learning to Hash with Variable Bit Encoding |
title_full_unstemmed |
AVBH: Asymmetric Learning to Hash with Variable Bit Encoding |
title_sort |
avbh: asymmetric learning to hash with variable bit encoding |
publisher |
Hindawi Limited |
series |
Scientific Programming |
issn |
1058-9244 1875-919X |
publishDate |
2020-01-01 |
description |
Nearest neighbour search (NNS) is the core of large data retrieval. Learning to hash is an effective way to solve the problems by representing high-dimensional data into a compact binary code. However, existing learning to hash methods needs long bit encoding to ensure the accuracy of query, and long bit encoding brings large cost of storage, which severely restricts the long bit encoding in the application of big data. An asymmetric learning to hash with variable bit encoding algorithm (AVBH) is proposed to solve the problem. The AVBH hash algorithm uses two types of hash mapping functions to encode the dataset and the query set into different length bits. For datasets, the hash code frequencies of datasets after random Fourier feature encoding are statistically analysed. The hash code with high frequency is compressed into a longer coding representation, and the hash code with low frequency is compressed into a shorter coding representation. The query point is quantized to a long bit hash code and compared with the same length cascade concatenated data point. Experiments on public datasets show that the proposed algorithm effectively reduces the cost of storage and improves the accuracy of query. |
url |
http://dx.doi.org/10.1155/2020/2424381 |
work_keys_str_mv |
AT yanduoren avbhasymmetriclearningtohashwithvariablebitencoding AT jiangboqian avbhasymmetriclearningtohashwithvariablebitencoding AT yihongdong avbhasymmetriclearningtohashwithvariablebitencoding AT yuxin avbhasymmetriclearningtohashwithvariablebitencoding AT huahuichen avbhasymmetriclearningtohashwithvariablebitencoding |
_version_ |
1721338878777884672 |