A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 95 === In order to decrease the error rate of speech recognition, speaker adaptation techniques are often used to adjust speaker-dependent acoustic models. MLLR (Maximum Likelihood Linear Regression) and MAP (Maximum a Posteriori) are two of the most popular techniqu...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2007
|
Online Access: | http://ndltd.ncl.edu.tw/handle/21269686851264592834 |
id |
ndltd-TW-095NTHU5394012 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095NTHU53940122015-10-13T16:51:13Z http://ndltd.ncl.edu.tw/handle/21269686851264592834 A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation 最小化音素錯誤鑑別式訓練法則應用於華語語者調適之研究 Niu Hsueh-Wen 牛學文 碩士 國立清華大學 資訊系統與應用研究所 95 In order to decrease the error rate of speech recognition, speaker adaptation techniques are often used to adjust speaker-dependent acoustic models. MLLR (Maximum Likelihood Linear Regression) and MAP (Maximum a Posteriori) are two of the most popular techniques in recent years. MLLR uses the technique of regression trees. It calculates the transform matrix for each leaf node of the tree. This makes it possible to use fewer sentences to decrease the error rate of HMM-based speech recognition. However, while we examined the recognition result, we found that although the overall error rate decreased, but the error rate of certain confusable phones was higher. In order to solve this problem, we propose the use MPE (Minimum Phone Error Discriminative Training) to solve this problem. We use the same corpus as the one in MLLR adaptation, and use MPE to make further adjustment to acoustic models which have been adapted by MLLR. Besides, we tested several methods such as adjusting I-smoothing factors or phone lattices to obtain finer result. Besides, we also introduced a new approach to reduce the computation time of both the lattice construction and the MPE- weight calculation, all based on a better use of n-best recognition (3.3.3). Furthermore, we proposed a new method to combine the statistic result of regression trees and I-smoothing factor based on the observation result of chapter 2.1.3. Experiment results show that it can further reduce the error rate. Jyh-Shing Jang 張智星 2007 學位論文 ; thesis 50 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 資訊系統與應用研究所 === 95 === In order to decrease the error rate of speech recognition, speaker adaptation techniques are often used to adjust speaker-dependent acoustic models. MLLR (Maximum Likelihood Linear Regression) and MAP (Maximum a Posteriori) are two of the most popular techniques in recent years. MLLR uses the technique of regression trees. It calculates the transform matrix for each leaf node of the tree. This makes it possible to use fewer sentences to decrease the error rate of HMM-based speech recognition. However, while we examined the recognition result, we found that although the overall error rate decreased, but the error rate of certain confusable phones was higher.
In order to solve this problem, we propose the use MPE (Minimum Phone Error Discriminative Training) to solve this problem. We use the same corpus as the one in MLLR adaptation, and use MPE to make further adjustment to acoustic models which have been adapted by MLLR. Besides, we tested several methods such as adjusting I-smoothing factors or phone lattices to obtain finer result. Besides, we also introduced a new approach to reduce the computation time of both the lattice construction and the MPE- weight calculation, all based on a better use of n-best recognition (3.3.3).
Furthermore, we proposed a new method to combine the statistic result of regression trees and I-smoothing factor based on the observation result of chapter 2.1.3. Experiment results show that it can further reduce the error rate.
|
author2 |
Jyh-Shing Jang |
author_facet |
Jyh-Shing Jang Niu Hsueh-Wen 牛學文 |
author |
Niu Hsueh-Wen 牛學文 |
spellingShingle |
Niu Hsueh-Wen 牛學文 A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation |
author_sort |
Niu Hsueh-Wen |
title |
A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation |
title_short |
A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation |
title_full |
A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation |
title_fullStr |
A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation |
title_full_unstemmed |
A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation |
title_sort |
study on minimum phone error discriminative training for mandarin chinese speaker adaptation |
publishDate |
2007 |
url |
http://ndltd.ncl.edu.tw/handle/21269686851264592834 |
work_keys_str_mv |
AT niuhsuehwen astudyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation AT niúxuéwén astudyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation AT niuhsuehwen zuìxiǎohuàyīnsùcuòwùjiànbiéshìxùnliànfǎzéyīngyòngyúhuáyǔyǔzhědiàoshìzhīyánjiū AT niúxuéwén zuìxiǎohuàyīnsùcuòwùjiànbiéshìxùnliànfǎzéyīngyòngyúhuáyǔyǔzhědiàoshìzhīyánjiū AT niuhsuehwen studyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation AT niúxuéwén studyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation |
_version_ |
1717775399084097536 |