Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree
碩士 === 國立清華大學 === 統計學研究所 === 86 === In this thesis, we use HHM to deal with the problem of large vocabulary recognition task of Taiwanese. The phone units we use are right context dependent (RCD) phonemes. If we just consider the effect of inside syllab...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Online Access: | http://ndltd.ncl.edu.tw/handle/07647714439930874732 |
id |
ndltd-TW-086NTHU0337010 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-086NTHU03370102016-06-29T04:13:30Z http://ndltd.ncl.edu.tw/handle/07647714439930874732 Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree 以右連音為單位運用決策分類樹的台語大辭彙語音辨識 Hsieh, Wen-Ping 謝文萍 碩士 國立清華大學 統計學研究所 86 In this thesis, we use HHM to deal with the problem of large vocabulary recognition task of Taiwanese. The phone units we use are right context dependent (RCD) phonemes. If we just consider the effect of inside syllable context dependency, the top 1 recognition accuracy reaches 88%. If we modify our models to include all inter-syllable context dependency, the result of top 1 recognition accuracy increases to 92.11% but the total number of states in our network increases to 4 times. To compensate the insufficient training data due to the large amount of models, we use the method of Acoustic Decision Tree to do state clustering.The total number of states decreases to 3/5, and the recognition rate also increases to 92.87%. Tseng S.T., Chu C.K. 江永進 學位論文 ; thesis 34 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立清華大學 === 統計學研究所 === 86 === In this thesis, we use HHM to deal with the problem of large
vocabulary recognition task of Taiwanese. The phone units we
use are right context dependent (RCD) phonemes. If we just
consider the effect of inside syllable context dependency, the
top 1 recognition accuracy reaches 88%. If we modify our models
to include all inter-syllable context dependency, the result of
top 1 recognition accuracy increases to 92.11% but the total
number of states in our network increases to 4 times. To
compensate the insufficient training data due to the large
amount of models, we use the method of Acoustic Decision Tree to
do state clustering.The total number of states decreases to 3/5,
and the recognition rate also increases to 92.87%.
|
author2 |
Tseng S.T., Chu C.K. |
author_facet |
Tseng S.T., Chu C.K. Hsieh, Wen-Ping 謝文萍 |
author |
Hsieh, Wen-Ping 謝文萍 |
spellingShingle |
Hsieh, Wen-Ping 謝文萍 Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree |
author_sort |
Hsieh, Wen-Ping |
title |
Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree |
title_short |
Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree |
title_full |
Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree |
title_fullStr |
Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree |
title_full_unstemmed |
Large Vocabulary Taiwanese Speech Recognition based on RCD using Acoustic Decision Tree |
title_sort |
large vocabulary taiwanese speech recognition based on rcd using acoustic decision tree |
url |
http://ndltd.ncl.edu.tw/handle/07647714439930874732 |
work_keys_str_mv |
AT hsiehwenping largevocabularytaiwanesespeechrecognitionbasedonrcdusingacousticdecisiontree AT xièwénpíng largevocabularytaiwanesespeechrecognitionbasedonrcdusingacousticdecisiontree AT hsiehwenping yǐyòuliányīnwèidānwèiyùnyòngjuécèfēnlèishùdetáiyǔdàcíhuìyǔyīnbiànshí AT xièwénpíng yǐyòuliányīnwèidānwèiyùnyòngjuécèfēnlèishùdetáiyǔdàcíhuìyǔyīnbiànshí |
_version_ |
1718326065669079040 |