On the Improvement of Landmark-based Audio Fingerprinting

碩士 === 國立清華大學 === 資訊工程學系 === 104 === Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fi...

Full description

Bibliographic Details
Main Authors:	Zenhon Zhuo, 卓真弘
Other Authors:	Jyh-Shing Roger Jang
Format:	Others
Language:	zh-TW
Published:	2016
Online Access:	http://ndltd.ncl.edu.tw/handle/32785416086001379811

id	ndltd-TW-104NTHU5392087
record_format	oai_dc
spelling	ndltd-TW-104NTHU53920872017-08-27T04:30:16Z http://ndltd.ncl.edu.tw/handle/32785416086001379811 On the Improvement of Landmark-based Audio Fingerprinting 改進以地標為基礎的音訊指紋辨識 Zenhon Zhuo 卓真弘碩士國立清華大學資訊工程學系 104 Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fingerprint methods, and put forward improved methods for preprocessing, and using new feature with machine learning to improve the original re-ranking method used. In the preprocessing phase, we set those value in power spectrum with energy smaller than 0 to 0, because these values are not robust to noise, and a high pass filter in the frequency direction added. To improve the original re-ranking method, we extract the new feature with the method proposed by Haistma [8] and compare the similarity of the new feature. Then, using machine learning methods to find a weighted sum of the simlilarities of the new feature and two original features. The machine learning methods used are Pranking, Ranking SVM and Genetic Algorithm. Genetic Algorithm have the best performance among the three methods. The final result is recognition rate raised from 81.21% to 86.04%. Jyh-Shing Roger Jang Biing-Feng Wang 張智星王炳豐 2016 學位論文 ; thesis 54 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立清華大學 === 資訊工程學系 === 104 === Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fingerprint methods, and put forward improved methods for preprocessing, and using new feature with machine learning to improve the original re-ranking method used. In the preprocessing phase, we set those value in power spectrum with energy smaller than 0 to 0, because these values are not robust to noise, and a high pass filter in the frequency direction added. To improve the original re-ranking method, we extract the new feature with the method proposed by Haistma [8] and compare the similarity of the new feature. Then, using machine learning methods to find a weighted sum of the simlilarities of the new feature and two original features. The machine learning methods used are Pranking, Ranking SVM and Genetic Algorithm. Genetic Algorithm have the best performance among the three methods. The final result is recognition rate raised from 81.21% to 86.04%.
author2	Jyh-Shing Roger Jang
author_facet	Jyh-Shing Roger Jang Zenhon Zhuo 卓真弘
author	Zenhon Zhuo 卓真弘
spellingShingle	Zenhon Zhuo 卓真弘 On the Improvement of Landmark-based Audio Fingerprinting
author_sort	Zenhon Zhuo
title	On the Improvement of Landmark-based Audio Fingerprinting
title_short	On the Improvement of Landmark-based Audio Fingerprinting
title_full	On the Improvement of Landmark-based Audio Fingerprinting
title_fullStr	On the Improvement of Landmark-based Audio Fingerprinting
title_full_unstemmed	On the Improvement of Landmark-based Audio Fingerprinting
title_sort	on the improvement of landmark-based audio fingerprinting
publishDate	2016
url	http://ndltd.ncl.edu.tw/handle/32785416086001379811
work_keys_str_mv	AT zenhonzhuo ontheimprovementoflandmarkbasedaudiofingerprinting AT zhuōzhēnhóng ontheimprovementoflandmarkbasedaudiofingerprinting AT zenhonzhuo gǎijìnyǐdebiāowèijīchǔdeyīnxùnzhǐwénbiànshí AT zhuōzhēnhóng gǎijìnyǐdebiāowèijīchǔdeyīnxùnzhǐwénbiànshí
_version_	1718519378347032576

On the Improvement of Landmark-based Audio Fingerprinting

Similar Items