Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients

碩士 === 中華大學 === 資訊工程學系(所) === 94 === We propose a method to automatically identify birds from their sounds in this paper. First, each syllable corresponding to a piece of vocalization is segmented. The average LPCC (ALPCC), average MFCC (AMFCC), Static MFCC (SMFCC), Two-dimensional MFCC (TDMFCC), Dy...

Full description

Bibliographic Details
Main Author: 林士棻
Other Authors: 李建興
Format: Others
Language:zh-TW
Published: 2006
Online Access:http://ndltd.ncl.edu.tw/handle/38302762655714685237
id ndltd-TW-094CHPI5392003
record_format oai_dc
spelling ndltd-TW-094CHPI53920032015-10-13T14:00:53Z http://ndltd.ncl.edu.tw/handle/38302762655714685237 Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients 運用二維梅爾倒頻譜係數於鳥類鳴叫聲之辨識 林士棻 碩士 中華大學 資訊工程學系(所) 94 We propose a method to automatically identify birds from their sounds in this paper. First, each syllable corresponding to a piece of vocalization is segmented. The average LPCC (ALPCC), average MFCC (AMFCC), Static MFCC (SMFCC), Two-dimensional MFCC (TDMFCC), Dynamic two-dimensional MFCC (DTDMFCC) and TDMFCC+DTDMFCC over all frames in a syllable are calculated as the vocalization features. Linear discriminant analysis (LDA) is exploited to increase the classification accuracy at a lower dimensional feature vector space. A clustering algorithm, called progressive constructive clustering (PCC) algorithm, is used to divide the feature vectors which were computed from the same bird species into several subclasses. In our experiments, TDMFCC+DTDMFCC can achieve average classification accuracy 90% and 89% for 420 bird species and 561 bird species. 李建興 2006 學位論文 ; thesis 95 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 中華大學 === 資訊工程學系(所) === 94 === We propose a method to automatically identify birds from their sounds in this paper. First, each syllable corresponding to a piece of vocalization is segmented. The average LPCC (ALPCC), average MFCC (AMFCC), Static MFCC (SMFCC), Two-dimensional MFCC (TDMFCC), Dynamic two-dimensional MFCC (DTDMFCC) and TDMFCC+DTDMFCC over all frames in a syllable are calculated as the vocalization features. Linear discriminant analysis (LDA) is exploited to increase the classification accuracy at a lower dimensional feature vector space. A clustering algorithm, called progressive constructive clustering (PCC) algorithm, is used to divide the feature vectors which were computed from the same bird species into several subclasses. In our experiments, TDMFCC+DTDMFCC can achieve average classification accuracy 90% and 89% for 420 bird species and 561 bird species.
author2 李建興
author_facet 李建興
林士棻
author 林士棻
spellingShingle 林士棻
Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
author_sort 林士棻
title Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
title_short Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
title_full Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
title_fullStr Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
title_full_unstemmed Bird songs recognition using two-dimensional Mel-scale frequency cepstral coefficients
title_sort bird songs recognition using two-dimensional mel-scale frequency cepstral coefficients
publishDate 2006
url http://ndltd.ncl.edu.tw/handle/38302762655714685237
work_keys_str_mv AT línshìfēn birdsongsrecognitionusingtwodimensionalmelscalefrequencycepstralcoefficients
AT línshìfēn yùnyòngèrwéiméiěrdàopínpǔxìshùyúniǎolèimíngjiàoshēngzhībiànshí
_version_ 1717747192500846592