Lazy Sampling for Weighted MinHash Algorithm

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 107 === The computation of data similarity is a fundamental topic in data mining and machine learning. However, as data set grows larger, the exact computation becomes time-consuming and unrealistic. To ameliorate this situation, several locality sensitive hash (LSH) t...

Full description

Bibliographic Details
Main Authors: Yung-Hsien Chung, 鍾詠先
Other Authors: Pu-Jen Cheng
Format: Others
Language:en_US
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/dy25vv