On the Improvement of Landmark-based Audio Fingerprinting
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fi...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/32785416086001379811 |
id |
ndltd-TW-104NTHU5392087 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104NTHU53920872017-08-27T04:30:16Z http://ndltd.ncl.edu.tw/handle/32785416086001379811 On the Improvement of Landmark-based Audio Fingerprinting 改進以地標為基礎的音訊指紋辨識 Zenhon Zhuo 卓真弘 碩士 國立清華大學 資訊工程學系 104 Abstract Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fingerprint methods, and put forward improved methods for preprocessing, and using new feature with machine learning to improve the original re-ranking method used. In the preprocessing phase, we set those value in power spectrum with energy smaller than 0 to 0, because these values are not robust to noise, and a high pass filter in the frequency direction added. To improve the original re-ranking method, we extract the new feature with the method proposed by Haistma [8] and compare the similarity of the new feature. Then, using machine learning methods to find a weighted sum of the simlilarities of the new feature and two original features. The machine learning methods used are Pranking, Ranking SVM and Genetic Algorithm. Genetic Algorithm have the best performance among the three methods. The final result is recognition rate raised from 81.21% to 86.04%. Jyh-Shing Roger Jang Biing-Feng Wang 張智星 王炳豐 2016 學位論文 ; thesis 54 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊工程學系 === 104 === Abstract
Audio Fingerprint (AFP) is a fast way of music retrieve. It first records a segment of a music through the microphone on a cellphone or tablet device, and sends the recorded segment to the server for AFP computation. This paper describes several audio fingerprint methods, and put forward improved methods for preprocessing, and using new feature with machine learning to improve the original re-ranking method used. In the preprocessing phase, we set those value in power spectrum with energy smaller than 0 to 0, because these values are not robust to noise, and a high pass filter in the frequency direction added. To improve the original re-ranking method, we extract the new feature with the method proposed by Haistma [8] and compare the similarity of the new feature. Then, using machine learning methods to find a weighted sum of the simlilarities of the new feature and two original features. The machine learning methods used are Pranking, Ranking SVM and Genetic Algorithm. Genetic Algorithm have the best performance among the three methods. The final result is recognition rate raised from 81.21% to 86.04%.
|
author2 |
Jyh-Shing Roger Jang |
author_facet |
Jyh-Shing Roger Jang Zenhon Zhuo 卓真弘 |
author |
Zenhon Zhuo 卓真弘 |
spellingShingle |
Zenhon Zhuo 卓真弘 On the Improvement of Landmark-based Audio Fingerprinting |
author_sort |
Zenhon Zhuo |
title |
On the Improvement of Landmark-based Audio Fingerprinting |
title_short |
On the Improvement of Landmark-based Audio Fingerprinting |
title_full |
On the Improvement of Landmark-based Audio Fingerprinting |
title_fullStr |
On the Improvement of Landmark-based Audio Fingerprinting |
title_full_unstemmed |
On the Improvement of Landmark-based Audio Fingerprinting |
title_sort |
on the improvement of landmark-based audio fingerprinting |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/32785416086001379811 |
work_keys_str_mv |
AT zenhonzhuo ontheimprovementoflandmarkbasedaudiofingerprinting AT zhuōzhēnhóng ontheimprovementoflandmarkbasedaudiofingerprinting AT zenhonzhuo gǎijìnyǐdebiāowèijīchǔdeyīnxùnzhǐwénbiànshí AT zhuōzhēnhóng gǎijìnyǐdebiāowèijīchǔdeyīnxùnzhǐwénbiànshí |
_version_ |
1718519378347032576 |