On the Improvement of Landmark-based Audio Fingerprinting

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fi...

Full description

Bibliographic Details
Main Authors: Zenhon Zhuo, 卓真弘
Other Authors: Jyh-Shing Roger Jang
Format: Others
Language:zh-TW
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/32785416086001379811
id ndltd-TW-104NTHU5392087
record_format oai_dc
spelling ndltd-TW-104NTHU53920872017-08-27T04:30:16Z http://ndltd.ncl.edu.tw/handle/32785416086001379811 On the Improvement of Landmark-based Audio Fingerprinting 改進以地標為基礎的音訊指紋辨識 Zenhon Zhuo 卓真弘 碩士 國立清華大學 資訊工程學系 104 Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fingerprint methods, and put forward improved methods for preprocessing, and using new feature with machine learning to improve the original re-ranking method used. In the preprocessing phase, we set those value in power spectrum with energy smaller than 0 to 0, because these values are not robust to noise, and a high pass filter in the frequency direction added. To improve the original re-ranking method, we extract the new feature with the method proposed by Haistma [8] and compare the similarity of the new feature. Then, using machine learning methods to find a weighted sum of the simlilarities of the new feature and two original features. The machine learning methods used are Pranking, Ranking SVM and Genetic Algorithm. Genetic Algorithm have the best performance among the three methods. The final result is recognition rate raised from 81.21% to 86.04%. Jyh-Shing Roger Jang Biing-Feng Wang 張智星 王炳豐 2016 學位論文 ; thesis 54 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 104 === Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fingerprint methods, and put forward improved methods for preprocessing, and using new feature with machine learning to improve the original re-ranking method used. In the preprocessing phase, we set those value in power spectrum with energy smaller than 0 to 0, because these values are not robust to noise, and a high pass filter in the frequency direction added. To improve the original re-ranking method, we extract the new feature with the method proposed by Haistma [8] and compare the similarity of the new feature. Then, using machine learning methods to find a weighted sum of the simlilarities of the new feature and two original features. The machine learning methods used are Pranking, Ranking SVM and Genetic Algorithm. Genetic Algorithm have the best performance among the three methods. The final result is recognition rate raised from 81.21% to 86.04%.
author2 Jyh-Shing Roger Jang
author_facet Jyh-Shing Roger Jang
Zenhon Zhuo
卓真弘
author Zenhon Zhuo
卓真弘
spellingShingle Zenhon Zhuo
卓真弘
On the Improvement of Landmark-based Audio Fingerprinting
author_sort Zenhon Zhuo
title On the Improvement of Landmark-based Audio Fingerprinting
title_short On the Improvement of Landmark-based Audio Fingerprinting
title_full On the Improvement of Landmark-based Audio Fingerprinting
title_fullStr On the Improvement of Landmark-based Audio Fingerprinting
title_full_unstemmed On the Improvement of Landmark-based Audio Fingerprinting
title_sort on the improvement of landmark-based audio fingerprinting
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/32785416086001379811
work_keys_str_mv AT zenhonzhuo ontheimprovementoflandmarkbasedaudiofingerprinting
AT zhuōzhēnhóng ontheimprovementoflandmarkbasedaudiofingerprinting
AT zenhonzhuo gǎijìnyǐdebiāowèijīchǔdeyīnxùnzhǐwénbiànshí
AT zhuōzhēnhóng gǎijìnyǐdebiāowèijīchǔdeyīnxùnzhǐwénbiànshí
_version_ 1718519378347032576