A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy

碩士 === 國立中興大學 === 資訊科學與工程學系 === 106 === Audio fingerprints help to identify the audio content from database. Audio fingerprinting is to match an audio recording from audio contents. The approach first calculates the similarity of fingerprints between audio recording and audio contents, and then matc...

Full description

Bibliographic Details
Main Authors: Ya-Ting Tsou, 鄒雅婷
Other Authors: 吳俊霖
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/g3jtny
id ndltd-TW-106NCHU5394048
record_format oai_dc
spelling ndltd-TW-106NCHU53940482019-05-16T01:24:30Z http://ndltd.ncl.edu.tw/handle/g3jtny A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy 使用常數Q轉換與分群策略的快速且強健音訊指紋方法之研究 Ya-Ting Tsou 鄒雅婷 碩士 國立中興大學 資訊科學與工程學系 106 Audio fingerprints help to identify the audio content from database. Audio fingerprinting is to match an audio recording from audio contents. The approach first calculates the similarity of fingerprints between audio recording and audio contents, and then matches the audio recording and audio contents by counting the number of similar fingerprints in a time ordering list. To generate audio fingerprints, the algorithm first reads audio signals from file and then transfer the signal into a spectrogram. After that, the algorithm extracts the features from the spectrogram and encodes their position as fingerprints. In this thesis, we propose a fast and robust audio fingerprinting method using constant Q transform (CQT) and a clustering strategy. We use CQT method to generate spectrogram, which can present the intensity of signal more clearly. In audio fingerprinting, it is time-consuming while matching audio recording and audio contents. To accelerate this process, we use a two-step searching algorithm with a Intelligent K-means clustering strategy. Our proposed approach first selects the candidate audio contents, then matches audio recording with these candidates. In addition, our designed algorithm supports GPU acceleration. In our experimental results, we compare our proposed approach with other approaches. Our approach is more accurate in many cases with distortion. On the other hand, our approach is more efficient to find the alignment time. 吳俊霖 2018 學位論文 ; thesis 40 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中興大學 === 資訊科學與工程學系 === 106 === Audio fingerprints help to identify the audio content from database. Audio fingerprinting is to match an audio recording from audio contents. The approach first calculates the similarity of fingerprints between audio recording and audio contents, and then matches the audio recording and audio contents by counting the number of similar fingerprints in a time ordering list. To generate audio fingerprints, the algorithm first reads audio signals from file and then transfer the signal into a spectrogram. After that, the algorithm extracts the features from the spectrogram and encodes their position as fingerprints. In this thesis, we propose a fast and robust audio fingerprinting method using constant Q transform (CQT) and a clustering strategy. We use CQT method to generate spectrogram, which can present the intensity of signal more clearly. In audio fingerprinting, it is time-consuming while matching audio recording and audio contents. To accelerate this process, we use a two-step searching algorithm with a Intelligent K-means clustering strategy. Our proposed approach first selects the candidate audio contents, then matches audio recording with these candidates. In addition, our designed algorithm supports GPU acceleration. In our experimental results, we compare our proposed approach with other approaches. Our approach is more accurate in many cases with distortion. On the other hand, our approach is more efficient to find the alignment time.
author2 吳俊霖
author_facet 吳俊霖
Ya-Ting Tsou
鄒雅婷
author Ya-Ting Tsou
鄒雅婷
spellingShingle Ya-Ting Tsou
鄒雅婷
A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
author_sort Ya-Ting Tsou
title A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
title_short A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
title_full A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
title_fullStr A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
title_full_unstemmed A Fast and Robust Audio Fingerprinting Method Using Constant Q Transform and Clustering Strategy
title_sort fast and robust audio fingerprinting method using constant q transform and clustering strategy
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/g3jtny
work_keys_str_mv AT yatingtsou afastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy
AT zōuyǎtíng afastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy
AT yatingtsou shǐyòngchángshùqzhuǎnhuànyǔfēnqúncèlüèdekuàisùqiěqiángjiànyīnxùnzhǐwénfāngfǎzhīyánjiū
AT zōuyǎtíng shǐyòngchángshùqzhuǎnhuànyǔfēnqúncèlüèdekuàisùqiěqiángjiànyīnxùnzhǐwénfāngfǎzhīyánjiū
AT yatingtsou fastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy
AT zōuyǎtíng fastandrobustaudiofingerprintingmethodusingconstantqtransformandclusteringstrategy
_version_ 1719175210787143680