A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping
碩士 === 國立臺灣科技大學 === 資訊工程系 === 91 === In this thesis, a speaker adaptation method is developed. This method needs only a small quantity of training utterances because the adaptation mechanism is operated on the level of MFCC feature parameter. First, an individual coordinate system is buil...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2003
|
Online Access: | http://ndltd.ncl.edu.tw/handle/26805190317076600956 |
id |
ndltd-TW-091NTUST392001 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-091NTUST3920012016-06-20T04:16:00Z http://ndltd.ncl.edu.tw/handle/26805190317076600956 A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping MFCC特徵空間座標系統對映之語者調適方法 Chun Hsin Wu 吳俊欣 碩士 國立臺灣科技大學 資訊工程系 91 In this thesis, a speaker adaptation method is developed. This method needs only a small quantity of training utterances because the adaptation mechanism is operated on the level of MFCC feature parameter. First, an individual coordinate system is built for each new speaker in order that his MFCC feature vectors can be decomposed into coordinate coefficients of the system. Then, the coordinate coefficients are directly mapped as coefficients of the coordinate system for a target person. Even though this mechanism is simple, it can indeed obtain good adaptation performance. To verify the performance of our adaptation method, we have executed several recognition experiments under different conditions. The conditions are for different kinds of vocabularies, including sing-vowel vocabulary, multi-vowel vocabulary, nasal-containing syllable vocabulary and dissyllabic word vocabulary. In speaker non-adapted mode, the original recognition error rates are 30.3%, 20.7%, 38.3% and 21.3% respectively. However, in speaker adapted mode, the error rates are reduced to 3.3%, 9.8%, 22.5% and 12.3% respectively. Hung-yan Gu 古鴻炎 2003 學位論文 ; thesis 0 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 91 === In this thesis, a speaker adaptation method is developed. This method needs only a small quantity of training utterances because the adaptation mechanism is operated on the level of MFCC feature parameter. First, an individual coordinate system is built for each new speaker in order that his MFCC feature vectors can be decomposed into coordinate coefficients of the system. Then, the coordinate coefficients are directly mapped as coefficients of the coordinate system for a target person. Even though this mechanism is simple, it can indeed obtain good adaptation performance. To verify the performance of our adaptation method, we have executed several recognition experiments under different conditions. The conditions are for different kinds of vocabularies, including sing-vowel vocabulary, multi-vowel vocabulary, nasal-containing syllable vocabulary and dissyllabic word vocabulary. In speaker non-adapted mode, the original recognition error rates are 30.3%, 20.7%, 38.3% and 21.3% respectively. However, in speaker adapted mode, the error rates are reduced to 3.3%, 9.8%, 22.5% and 12.3% respectively.
|
author2 |
Hung-yan Gu |
author_facet |
Hung-yan Gu Chun Hsin Wu 吳俊欣 |
author |
Chun Hsin Wu 吳俊欣 |
spellingShingle |
Chun Hsin Wu 吳俊欣 A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping |
author_sort |
Chun Hsin Wu |
title |
A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping |
title_short |
A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping |
title_full |
A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping |
title_fullStr |
A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping |
title_full_unstemmed |
A Speaker Adaptation Method Based on MFCC Feature Space Coordinate System Mapping |
title_sort |
speaker adaptation method based on mfcc feature space coordinate system mapping |
publishDate |
2003 |
url |
http://ndltd.ncl.edu.tw/handle/26805190317076600956 |
work_keys_str_mv |
AT chunhsinwu aspeakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping AT wújùnxīn aspeakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping AT chunhsinwu mfcctèzhēngkōngjiānzuòbiāoxìtǒngduìyìngzhīyǔzhědiàoshìfāngfǎ AT wújùnxīn mfcctèzhēngkōngjiānzuòbiāoxìtǒngduìyìngzhīyǔzhědiàoshìfāngfǎ AT chunhsinwu speakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping AT wújùnxīn speakeradaptationmethodbasedonmfccfeaturespacecoordinatesystemmapping |
_version_ |
1718311228511617024 |