Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
碩士 === 中華大學 === 資訊工程學系(所) === 94 === We propose a method to automatically identify birds from their sounds in this paper. First, each syllable corresponding to a piece of vocalization is segmented. The average LPCC (ALPCC), average MFCC (AMFCC), Static MFCC (SMFCC), Two-dimensional MFCC (TDMFCC), Dy...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2006
|
Online Access: | http://ndltd.ncl.edu.tw/handle/38302762655714685237 |
id |
ndltd-TW-094CHPI5392003 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-094CHPI53920032015-10-13T14:00:53Z http://ndltd.ncl.edu.tw/handle/38302762655714685237 Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients 運用二維梅爾倒頻譜係數於鳥類鳴叫聲之辨識 林士棻 碩士 中華大學 資訊工程學系(所) 94 We propose a method to automatically identify birds from their sounds in this paper. First, each syllable corresponding to a piece of vocalization is segmented. The average LPCC (ALPCC), average MFCC (AMFCC), Static MFCC (SMFCC), Two-dimensional MFCC (TDMFCC), Dynamic two-dimensional MFCC (DTDMFCC) and TDMFCC+DTDMFCC over all frames in a syllable are calculated as the vocalization features. Linear discriminant analysis (LDA) is exploited to increase the classification accuracy at a lower dimensional feature vector space. A clustering algorithm, called progressive constructive clustering (PCC) algorithm, is used to divide the feature vectors which were computed from the same bird species into several subclasses. In our experiments, TDMFCC+DTDMFCC can achieve average classification accuracy 90% and 89% for 420 bird species and 561 bird species. 李建興 2006 學位論文 ; thesis 95 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 中華大學 === 資訊工程學系(所) === 94 === We propose a method to automatically identify birds from their sounds in this paper. First, each syllable corresponding to a piece of vocalization is segmented. The average LPCC (ALPCC), average MFCC (AMFCC), Static MFCC (SMFCC), Two-dimensional MFCC (TDMFCC), Dynamic two-dimensional MFCC (DTDMFCC) and TDMFCC+DTDMFCC over all frames in a syllable are calculated as the vocalization features. Linear discriminant analysis (LDA) is exploited to increase the classification accuracy at a lower dimensional feature vector space. A clustering algorithm, called progressive constructive clustering (PCC) algorithm, is used to divide the feature vectors which were computed from the same bird species into several subclasses. In our experiments, TDMFCC+DTDMFCC can achieve average classification accuracy 90% and 89% for 420 bird species and 561 bird species.
|
author2 |
李建興 |
author_facet |
李建興 林士棻 |
author |
林士棻 |
spellingShingle |
林士棻 Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients |
author_sort |
林士棻 |
title |
Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients |
title_short |
Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients |
title_full |
Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients |
title_fullStr |
Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients |
title_full_unstemmed |
Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients |
title_sort |
bird songs recognition using two-dimensional mel-scale frequency cepstral coefficients |
publishDate |
2006 |
url |
http://ndltd.ncl.edu.tw/handle/38302762655714685237 |
work_keys_str_mv |
AT línshìfēn birdsongsrecognitionusingtwodimensionalmelscalefrequencycepstralcoefficients AT línshìfēn yùnyòngèrwéiméiěrdàopínpǔxìshùyúniǎolèimíngjiàoshēngzhībiànshí |
_version_ |
1717747192500846592 |