A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System
碩士 === 國立中山大學 === 電機工程學系研究所 === 97 === This thesis investigates the recognition rate improvement strategies for a Mandarin speech recognition system. Both automatic tone recognition and consonant correction schemes are studied and applied to the Mandarin address inputting system and the Mandarin 2,...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/a87d65 |
id |
ndltd-TW-097NSYS5442096 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097NSYS54420962019-05-29T03:42:54Z http://ndltd.ncl.edu.tw/handle/a87d65 A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System 中文語音辨識系統增進辨識率之策略研究-以地址系統與二、三、四字詞系統為例 Wen-kuang Hsieh 謝文廣 碩士 國立中山大學 電機工程學系研究所 97 This thesis investigates the recognition rate improvement strategies for a Mandarin speech recognition system. Both automatic tone recognition and consonant correction schemes are studied and applied to the Mandarin address inputting system and the Mandarin 2, 3, 4-word phrase recognition systems. For automatic tone recognition scheme, the acoustic properties of the four tones in the Mandarin training database are estimated statistically by 4 sets of parameters within 6 minutes. These automatically generated parameters can greatly increase the tone recognition accuracy, and at the same time reduce the amount of time spent in the manual tone parameter adjustment, that is about 8 hours in general. For consonant correction scheme, the sub-syllable models are developed to enhance the consonant recognition accuracy, and hence further improve the overall correct rate for the whole Mandarin phrases. Experimental results indicate that over 90% correct rate can be achieved for the Mandarin address inputting system with 180 thousand place names by applying the above two schemes. Furthermore, the recognition rates for the Mandarin 2, 3, 4-word phrase recognition systems with 116 thousand phrases in total can be improved from 77%, 94% and 97.5%, to 85%, 96% and 98% respectively. Chih-Chien Chen 陳志堅 2009 學位論文 ; thesis 57 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中山大學 === 電機工程學系研究所 === 97 === This thesis investigates the recognition rate improvement strategies for a Mandarin speech recognition system. Both automatic tone recognition and consonant correction schemes are studied and applied to the Mandarin address inputting system and the Mandarin 2, 3, 4-word phrase recognition systems. For automatic tone recognition scheme, the acoustic properties of the four tones in the Mandarin training database are estimated statistically by 4 sets of parameters within 6 minutes. These automatically generated parameters can greatly increase the tone recognition accuracy, and at the same time reduce the amount of time spent in the manual tone parameter adjustment, that is about 8 hours in general. For consonant correction scheme, the sub-syllable models are developed to enhance the consonant recognition accuracy, and hence further improve the overall correct rate for the whole Mandarin phrases. Experimental results indicate that over 90% correct rate can be achieved for the Mandarin address inputting system with 180 thousand place names by applying the above two schemes. Furthermore, the recognition rates for the Mandarin 2, 3, 4-word phrase recognition systems with 116 thousand phrases in total can be improved from 77%, 94% and 97.5%, to 85%, 96% and 98% respectively.
|
author2 |
Chih-Chien Chen |
author_facet |
Chih-Chien Chen Wen-kuang Hsieh 謝文廣 |
author |
Wen-kuang Hsieh 謝文廣 |
spellingShingle |
Wen-kuang Hsieh 謝文廣 A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System |
author_sort |
Wen-kuang Hsieh |
title |
A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System |
title_short |
A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System |
title_full |
A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System |
title_fullStr |
A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System |
title_full_unstemmed |
A Design of Recognition Rate Improving Strategy for Mandarin Speech Recognition System - A Case Study on Address Inputting System and Phrase Recognition System |
title_sort |
design of recognition rate improving strategy for mandarin speech recognition system - a case study on address inputting system and phrase recognition system |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/a87d65 |
work_keys_str_mv |
AT wenkuanghsieh adesignofrecognitionrateimprovingstrategyformandarinspeechrecognitionsystemacasestudyonaddressinputtingsystemandphraserecognitionsystem AT xièwénguǎng adesignofrecognitionrateimprovingstrategyformandarinspeechrecognitionsystemacasestudyonaddressinputtingsystemandphraserecognitionsystem AT wenkuanghsieh zhōngwényǔyīnbiànshíxìtǒngzēngjìnbiànshílǜzhīcèlüèyánjiūyǐdezhǐxìtǒngyǔèrsānsìzìcíxìtǒngwèilì AT xièwénguǎng zhōngwényǔyīnbiànshíxìtǒngzēngjìnbiànshílǜzhīcèlüèyánjiūyǐdezhǐxìtǒngyǔèrsānsìzìcíxìtǒngwèilì AT wenkuanghsieh designofrecognitionrateimprovingstrategyformandarinspeechrecognitionsystemacasestudyonaddressinputtingsystemandphraserecognitionsystem AT xièwénguǎng designofrecognitionrateimprovingstrategyformandarinspeechrecognitionsystemacasestudyonaddressinputtingsystemandphraserecognitionsystem |
_version_ |
1719193088009699328 |