Intelligent Language Modeling and Processing in Mandarin Speech Recognition

碩士 === 國立臺灣大學 === 資訊工程研究所 === 81 === The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and exp...

Full description

Bibliographic Details
Main Authors:	Yen-Ju Yang, 楊燕珠
Other Authors:	Lin-Shan Lee;Keh-Jiann Chen
Format:	Others
Language:	zh-TW
Published:	1993
Online Access:	http://ndltd.ncl.edu.tw/handle/67737403331273368411

id	ndltd-TW-081NTU00392017
record_format	oai_dc
spelling	ndltd-TW-081NTU003920172016-02-10T04:09:01Z http://ndltd.ncl.edu.tw/handle/67737403331273368411 Intelligent Language Modeling and Processing in Mandarin Speech Recognition 國語語音辨認中智慧型語言模型與處理之研究 Yen-Ju Yang 楊燕珠碩士國立臺灣大學資訊工程研究所 81 The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and experience from tested sentence. In this thesis, the recognition method is based on bigram Markov model under various smoothing techniques and various methods to incorporate unigram Markov model. In order to improve recognition rate, speed and the reliability, the special feature of this thesis is to propose some improve methods : syllable filter, learning, dynamic training and cache. They can make the system more and more intelligent. A language modeling and processing system has been developed on SUN workstation. It has tested some news on newspaper, articles in magazine and short story. It has following advantages : . high correct rate The character recognition system has average recognition rate 87.83%. When the whole text has filled into cache, we test the text second time, a very high recognition rate 98.87% can be obtained under the cache effect. Then we test the text third time someday later, the average recognition rate is 92.12% under learning effect. The word recognition system with very large vocabulary dictionary has average recognition rate 91.31%, 99.71% and 94.27% (calculated by character unit). . high speed The character recognition system needs 0.002 second on average to recognize a character. While the word recognition system needs 0.03 second on averge to recognize a word. . flexible data structure The system can switch between different models easily.．module design　 The system can use different algorithm in every module easily. Lin-Shan Lee;Keh-Jiann Chen 李琳山;陳克健 1993 學位論文 ; thesis 114 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 資訊工程研究所 === 81 === The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and experience from tested sentence. In this thesis, the recognition method is based on bigram Markov model under various smoothing techniques and various methods to incorporate unigram Markov model. In order to improve recognition rate, speed and the reliability, the special feature of this thesis is to propose some improve methods : syllable filter, learning, dynamic training and cache. They can make the system more and more intelligent. A language modeling and processing system has been developed on SUN workstation. It has tested some news on newspaper, articles in magazine and short story. It has following advantages : . high correct rate The character recognition system has average recognition rate 87.83%. When the whole text has filled into cache, we test the text second time, a very high recognition rate 98.87% can be obtained under the cache effect. Then we test the text third time someday later, the average recognition rate is 92.12% under learning effect. The word recognition system with very large vocabulary dictionary has average recognition rate 91.31%, 99.71% and 94.27% (calculated by character unit). . high speed The character recognition system needs 0.002 second on average to recognize a character. While the word recognition system needs 0.03 second on averge to recognize a word. . flexible data structure The system can switch between different models easily.．module design　 The system can use different algorithm in every module easily.
author2	Lin-Shan Lee;Keh-Jiann Chen
author_facet	Lin-Shan Lee;Keh-Jiann Chen Yen-Ju Yang 楊燕珠
author	Yen-Ju Yang 楊燕珠
spellingShingle	Yen-Ju Yang 楊燕珠 Intelligent Language Modeling and Processing in Mandarin Speech Recognition
author_sort	Yen-Ju Yang
title	Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_short	Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_full	Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_fullStr	Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_full_unstemmed	Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_sort	intelligent language modeling and processing in mandarin speech recognition
publishDate	1993
url	http://ndltd.ncl.edu.tw/handle/67737403331273368411
work_keys_str_mv	AT yenjuyang intelligentlanguagemodelingandprocessinginmandarinspeechrecognition AT yángyànzhū intelligentlanguagemodelingandprocessinginmandarinspeechrecognition AT yenjuyang guóyǔyǔyīnbiànrènzhōngzhìhuìxíngyǔyánmóxíngyǔchùlǐzhīyánjiū AT yángyànzhū guóyǔyǔyīnbiànrènzhōngzhìhuìxíngyǔyánmóxíngyǔchùlǐzhīyánjiū
_version_	1718185269807546368

Intelligent Language Modeling and Processing in Mandarin Speech Recognition

Similar Items