A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping

碩士 === 國立臺灣科技大學 === 資訊工程系 === 91 === In this thesis, a speaker adaptation method is developed. This method needs only a small quantity of training utterances because the adaptation mechanism is operated on the level of MFCC feature parameter. First, an individual coordinate system is buil...

Full description

Bibliographic Details
Main Authors: Chun Hsin Wu, 吳俊欣
Other Authors: Hung-yan Gu
Format: Others
Language:zh-TW
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/26805190317076600956
id ndltd-TW-091NTUST392001
record_format oai_dc
spelling ndltd-TW-091NTUST3920012016-06-20T04:16:00Z http://ndltd.ncl.edu.tw/handle/26805190317076600956 A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping MFCC特徵空間座標系統對映之語者調適方法 Chun Hsin Wu 吳俊欣 碩士 國立臺灣科技大學 資訊工程系 91 In this thesis, a speaker adaptation method is developed. This method needs only a small quantity of training utterances because the adaptation mechanism is operated on the level of MFCC feature parameter. First, an individual coordinate system is built for each new speaker in order that his MFCC feature vectors can be decomposed into coordinate coefficients of the system. Then, the coordinate coefficients are directly mapped as coefficients of the coordinate system for a target person. Even though this mechanism is simple, it can indeed obtain good adaptation performance. To verify the performance of our adaptation method, we have executed several recognition experiments under different conditions. The conditions are for different kinds of vocabularies, including sing-vowel vocabulary, multi-vowel vocabulary, nasal-containing syllable vocabulary and dissyllabic word vocabulary. In speaker non-adapted mode, the original recognition error rates are 30.3%, 20.7%, 38.3% and 21.3% respectively. However, in speaker adapted mode, the error rates are reduced to 3.3%, 9.8%, 22.5% and 12.3% respectively. Hung-yan Gu 古鴻炎 2003 學位論文 ; thesis 0 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 91 === In this thesis, a speaker adaptation method is developed. This method needs only a small quantity of training utterances because the adaptation mechanism is operated on the level of MFCC feature parameter. First, an individual coordinate system is built for each new speaker in order that his MFCC feature vectors can be decomposed into coordinate coefficients of the system. Then, the coordinate coefficients are directly mapped as coefficients of the coordinate system for a target person. Even though this mechanism is simple, it can indeed obtain good adaptation performance. To verify the performance of our adaptation method, we have executed several recognition experiments under different conditions. The conditions are for different kinds of vocabularies, including sing-vowel vocabulary, multi-vowel vocabulary, nasal-containing syllable vocabulary and dissyllabic word vocabulary. In speaker non-adapted mode, the original recognition error rates are 30.3%, 20.7%, 38.3% and 21.3% respectively. However, in speaker adapted mode, the error rates are reduced to 3.3%, 9.8%, 22.5% and 12.3% respectively.
author2 Hung-yan Gu
author_facet Hung-yan Gu
Chun Hsin Wu
吳俊欣
author Chun Hsin Wu
吳俊欣
spellingShingle Chun Hsin Wu
吳俊欣
A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
author_sort Chun Hsin Wu
title A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
title_short A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
title_full A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
title_fullStr A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
title_full_unstemmed A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
title_sort speaker adaptation method based on mfcc feature space coordinate system mapping
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/26805190317076600956
work_keys_str_mv AT chunhsinwu aspeakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping
AT wújùnxīn aspeakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping
AT chunhsinwu mfcctèzhēngkōngjiānzuòbiāoxìtǒngduìyìngzhīyǔzhědiàoshìfāngfǎ
AT wújùnxīn mfcctèzhēngkōngjiānzuòbiāoxìtǒngduìyìngzhīyǔzhědiàoshìfāngfǎ
AT chunhsinwu speakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping
AT wújùnxīn speakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping
_version_ 1718311228511617024