A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
碩士 === 國立中興大學 === 資訊科學與工程學系 === 106 === Audio fingerprints help to identify the audio content from database. Audio fingerprinting is to match an audio recording from audio contents. The approach first calculates the similarity of fingerprints between audio recording and audio contents, and then matc...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2018
|
Online Access: | http://ndltd.ncl.edu.tw/handle/g3jtny |
id |
ndltd-TW-106NCHU5394048 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-106NCHU53940482019-05-16T01:24:30Z http://ndltd.ncl.edu.tw/handle/g3jtny A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy 使用常數Q轉換與分群策略的快速且強健音訊指紋方法之研究 Ya-Ting Tsou 鄒雅婷 碩士 國立中興大學 資訊科學與工程學系 106 Audio fingerprints help to identify the audio content from database. Audio fingerprinting is to match an audio recording from audio contents. The approach first calculates the similarity of fingerprints between audio recording and audio contents, and then matches the audio recording and audio contents by counting the number of similar fingerprints in a time ordering list. To generate audio fingerprints, the algorithm first reads audio signals from file and then transfer the signal into a spectrogram. After that, the algorithm extracts the features from the spectrogram and encodes their position as fingerprints. In this thesis, we propose a fast and robust audio fingerprinting method using constant Q transform (CQT) and a clustering strategy. We use CQT method to generate spectrogram, which can present the intensity of signal more clearly. In audio fingerprinting, it is time-consuming while matching audio recording and audio contents. To accelerate this process, we use a two-step searching algorithm with a Intelligent K-means clustering strategy. Our proposed approach first selects the candidate audio contents, then matches audio recording with these candidates. In addition, our designed algorithm supports GPU acceleration. In our experimental results, we compare our proposed approach with other approaches. Our approach is more accurate in many cases with distortion. On the other hand, our approach is more efficient to find the alignment time. 吳俊霖 2018 學位論文 ; thesis 40 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中興大學 === 資訊科學與工程學系 === 106 === Audio fingerprints help to identify the audio content from database. Audio fingerprinting is to match an audio recording from audio contents. The approach first calculates the similarity of fingerprints between audio recording and audio contents, and then matches the audio recording and audio contents by counting the number of similar fingerprints in a time ordering list. To generate audio fingerprints, the algorithm first reads audio signals from file and then transfer the signal into a spectrogram. After that, the algorithm extracts the features from the spectrogram and encodes their position as fingerprints.
In this thesis, we propose a fast and robust audio fingerprinting method using constant Q transform (CQT) and a clustering strategy. We use CQT method to generate spectrogram, which can present the intensity of signal more clearly. In audio fingerprinting, it is time-consuming while matching audio recording and audio contents. To accelerate this process, we use a two-step searching algorithm with a Intelligent K-means clustering strategy. Our proposed approach first selects the candidate audio contents, then matches audio recording with these candidates. In addition, our designed algorithm supports GPU acceleration.
In our experimental results, we compare our proposed approach with other approaches. Our approach is more accurate in many cases with distortion. On the other hand, our approach is more efficient to find the alignment time.
|
author2 |
吳俊霖 |
author_facet |
吳俊霖 Ya-Ting Tsou 鄒雅婷 |
author |
Ya-Ting Tsou 鄒雅婷 |
spellingShingle |
Ya-Ting Tsou 鄒雅婷 A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy |
author_sort |
Ya-Ting Tsou |
title |
A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy |
title_short |
A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy |
title_full |
A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy |
title_fullStr |
A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy |
title_full_unstemmed |
A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy |
title_sort |
fast and robust audio fingerprinting method using constant q transform and clustering strategy |
publishDate |
2018 |
url |
http://ndltd.ncl.edu.tw/handle/g3jtny |
work_keys_str_mv |
AT yatingtsou afastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy AT zōuyǎtíng afastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy AT yatingtsou shǐyòngchángshùqzhuǎnhuànyǔfēnqúncèlüèdekuàisùqiěqiángjiànyīnxùnzhǐwénfāngfǎzhīyánjiū AT zōuyǎtíng shǐyòngchángshùqzhuǎnhuànyǔfēnqúncèlüèdekuàisùqiěqiángjiànyīnxùnzhǐwénfāngfǎzhīyánjiū AT yatingtsou fastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy AT zōuyǎtíng fastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy |
_version_ |
1719175210787143680 |