Intelligent Language Modeling and Processing in Mandarin Speech Recognition

碩士 === 國立臺灣大學 === 資訊工程研究所 === 81 === The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and exp...

Full description

Bibliographic Details
Main Authors: Yen-Ju Yang, 楊燕珠
Other Authors: Lin-Shan Lee;Keh-Jiann Chen
Format: Others
Language:zh-TW
Published: 1993
Online Access:http://ndltd.ncl.edu.tw/handle/67737403331273368411
id ndltd-TW-081NTU00392017
record_format oai_dc
spelling ndltd-TW-081NTU003920172016-02-10T04:09:01Z http://ndltd.ncl.edu.tw/handle/67737403331273368411 Intelligent Language Modeling and Processing in Mandarin Speech Recognition 國語語音辨認中智慧型語言模型與處理之研究 Yen-Ju Yang 楊燕珠 碩士 國立臺灣大學 資訊工程研究所 81 The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and experience from tested sentence. In this thesis, the recognition method is based on bigram Markov model under various smoothing techniques and various methods to incorporate unigram Markov model. In order to improve recognition rate, speed and the reliability, the special feature of this thesis is to propose some improve methods : syllable filter, learning, dynamic training and cache. They can make the system more and more intelligent. A language modeling and processing system has been developed on SUN workstation. It has tested some news on newspaper, articles in magazine and short story. It has following advantages : . high correct rate The character recognition system has average recognition rate 87.83%. When the whole text has filled into cache, we test the text second time, a very high recognition rate 98.87% can be obtained under the cache effect. Then we test the text third time someday later, the average recognition rate is 92.12% under learning effect. The word recognition system with very large vocabulary dictionary has average recognition rate 91.31%, 99.71% and 94.27% (calculated by character unit). . high speed The character recognition system needs 0.002 second on average to recognize a character. While the word recognition system needs 0.03 second on averge to recognize a word. . flexible data structure The system can switch between different models easily..module design  The system can use different algorithm in every module easily. Lin-Shan Lee;Keh-Jiann Chen 李琳山;陳克健 1993 學位論文 ; thesis 114 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程研究所 === 81 === The purpose of this thesis is to develop an intelligent language modeling and processing system in Mandarin speech recognition, so that to get high recognition rate, high speed of processing time and experience from tested sentence. In this thesis, the recognition method is based on bigram Markov model under various smoothing techniques and various methods to incorporate unigram Markov model. In order to improve recognition rate, speed and the reliability, the special feature of this thesis is to propose some improve methods : syllable filter, learning, dynamic training and cache. They can make the system more and more intelligent. A language modeling and processing system has been developed on SUN workstation. It has tested some news on newspaper, articles in magazine and short story. It has following advantages : . high correct rate The character recognition system has average recognition rate 87.83%. When the whole text has filled into cache, we test the text second time, a very high recognition rate 98.87% can be obtained under the cache effect. Then we test the text third time someday later, the average recognition rate is 92.12% under learning effect. The word recognition system with very large vocabulary dictionary has average recognition rate 91.31%, 99.71% and 94.27% (calculated by character unit). . high speed The character recognition system needs 0.002 second on average to recognize a character. While the word recognition system needs 0.03 second on averge to recognize a word. . flexible data structure The system can switch between different models easily..module design  The system can use different algorithm in every module easily.
author2 Lin-Shan Lee;Keh-Jiann Chen
author_facet Lin-Shan Lee;Keh-Jiann Chen
Yen-Ju Yang
楊燕珠
author Yen-Ju Yang
楊燕珠
spellingShingle Yen-Ju Yang
楊燕珠
Intelligent Language Modeling and Processing in Mandarin Speech Recognition
author_sort Yen-Ju Yang
title Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_short Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_full Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_fullStr Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_full_unstemmed Intelligent Language Modeling and Processing in Mandarin Speech Recognition
title_sort intelligent language modeling and processing in mandarin speech recognition
publishDate 1993
url http://ndltd.ncl.edu.tw/handle/67737403331273368411
work_keys_str_mv AT yenjuyang intelligentlanguagemodelingandprocessinginmandarinspeechrecognition
AT yángyànzhū intelligentlanguagemodelingandprocessinginmandarinspeechrecognition
AT yenjuyang guóyǔyǔyīnbiànrènzhōngzhìhuìxíngyǔyánmóxíngyǔchùlǐzhīyánjiū
AT yángyànzhū guóyǔyǔyīnbiànrènzhōngzhìhuìxíngyǔyánmóxíngyǔchùlǐzhīyánjiū
_version_ 1718185269807546368