A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation

碩士 === 國立清華大學 === 資訊系統與應用研究所 === 95 === In order to decrease the error rate of speech recognition, speaker adaptation techniques are often used to adjust speaker-dependent acoustic models. MLLR (Maximum Likelihood Linear Regression) and MAP (Maximum a Posteriori) are two of the most popular techniqu...

Full description

Bibliographic Details
Main Authors: Niu Hsueh-Wen, 牛學文
Other Authors: Jyh-Shing Jang
Format: Others
Language:zh-TW
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/21269686851264592834
id ndltd-TW-095NTHU5394012
record_format oai_dc
spelling ndltd-TW-095NTHU53940122015-10-13T16:51:13Z http://ndltd.ncl.edu.tw/handle/21269686851264592834 A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation 最小化音素錯誤鑑別式訓練法則應用於華語語者調適之研究 Niu Hsueh-Wen 牛學文 碩士 國立清華大學 資訊系統與應用研究所 95 In order to decrease the error rate of speech recognition, speaker adaptation techniques are often used to adjust speaker-dependent acoustic models. MLLR (Maximum Likelihood Linear Regression) and MAP (Maximum a Posteriori) are two of the most popular techniques in recent years. MLLR uses the technique of regression trees. It calculates the transform matrix for each leaf node of the tree. This makes it possible to use fewer sentences to decrease the error rate of HMM-based speech recognition. However, while we examined the recognition result, we found that although the overall error rate decreased, but the error rate of certain confusable phones was higher. In order to solve this problem, we propose the use MPE (Minimum Phone Error Discriminative Training) to solve this problem. We use the same corpus as the one in MLLR adaptation, and use MPE to make further adjustment to acoustic models which have been adapted by MLLR. Besides, we tested several methods such as adjusting I-smoothing factors or phone lattices to obtain finer result. Besides, we also introduced a new approach to reduce the computation time of both the lattice construction and the MPE- weight calculation, all based on a better use of n-best recognition (3.3.3). Furthermore, we proposed a new method to combine the statistic result of regression trees and I-smoothing factor based on the observation result of chapter 2.1.3. Experiment results show that it can further reduce the error rate. Jyh-Shing Jang 張智星 2007 學位論文 ; thesis 50 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊系統與應用研究所 === 95 === In order to decrease the error rate of speech recognition, speaker adaptation techniques are often used to adjust speaker-dependent acoustic models. MLLR (Maximum Likelihood Linear Regression) and MAP (Maximum a Posteriori) are two of the most popular techniques in recent years. MLLR uses the technique of regression trees. It calculates the transform matrix for each leaf node of the tree. This makes it possible to use fewer sentences to decrease the error rate of HMM-based speech recognition. However, while we examined the recognition result, we found that although the overall error rate decreased, but the error rate of certain confusable phones was higher. In order to solve this problem, we propose the use MPE (Minimum Phone Error Discriminative Training) to solve this problem. We use the same corpus as the one in MLLR adaptation, and use MPE to make further adjustment to acoustic models which have been adapted by MLLR. Besides, we tested several methods such as adjusting I-smoothing factors or phone lattices to obtain finer result. Besides, we also introduced a new approach to reduce the computation time of both the lattice construction and the MPE- weight calculation, all based on a better use of n-best recognition (3.3.3). Furthermore, we proposed a new method to combine the statistic result of regression trees and I-smoothing factor based on the observation result of chapter 2.1.3. Experiment results show that it can further reduce the error rate.
author2 Jyh-Shing Jang
author_facet Jyh-Shing Jang
Niu Hsueh-Wen
牛學文
author Niu Hsueh-Wen
牛學文
spellingShingle Niu Hsueh-Wen
牛學文
A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
author_sort Niu Hsueh-Wen
title A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
title_short A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
title_full A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
title_fullStr A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
title_full_unstemmed A Study on Minimum Phone Error Discriminative Training for Mandarin Chinese Speaker Adaptation
title_sort study on minimum phone error discriminative training for mandarin chinese speaker adaptation
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/21269686851264592834
work_keys_str_mv AT niuhsuehwen astudyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation
AT niúxuéwén astudyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation
AT niuhsuehwen zuìxiǎohuàyīnsùcuòwùjiànbiéshìxùnliànfǎzéyīngyòngyúhuáyǔyǔzhědiàoshìzhīyánjiū
AT niúxuéwén zuìxiǎohuàyīnsùcuòwùjiànbiéshìxùnliànfǎzéyīngyòngyúhuáyǔyǔzhědiàoshìzhīyánjiū
AT niuhsuehwen studyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation
AT niúxuéwén studyonminimumphoneerrordiscriminativetrainingformandarinchinesespeakeradaptation
_version_ 1717775399084097536