A Mandarin Speech Recognition System Using Weighted Finite-State Transducer

碩士 === 國立交通大學 === 電信工程研究所 === 100 === This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM)...

Full description

Bibliographic Details
Main Authors:	Lin, Ang-Hsing, 林昂星
Other Authors:	Wang, Yih-Ru
Format:	Others
Language:	zh-TW
Published:	2012
Online Access:	http://ndltd.ncl.edu.tw/handle/28065850188969895626

id	ndltd-TW-100NCTU5435115
record_format	oai_dc
spelling	ndltd-TW-100NCTU54351152016-03-28T04:20:39Z http://ndltd.ncl.edu.tw/handle/28065850188969895626 A Mandarin Speech Recognition System Using Weighted Finite-State Transducer 使用有限狀態轉換器之漢語語音辨認系統 Lin, Ang-Hsing 林昂星碩士國立交通大學電信工程研究所 100 This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM) graph, and integrates into the MSRS. The experimental results indentify the Word Error Rate (WER) at about 55% is related to the appearance of OOV words, and statistics shows that one OOV word results in 2.4 words error averagely in MSRS. According to the statistical results, it shows that the names OOV words accounts for about 30%, and in which three words Chinese names accounts for about 23%, In order to reduce the negative impact of the OOV words results in the MSRS, we introduce a hierarchical language model, training name model to assist lower WER. The test corpus uses for the read-type long sentences TCC300 corpus. The 13.76% WER is obtained by using HTK two-stage recognition, while use of WFST RT=13 can achieve the same WER, the recognition speed is about 15 times faster than the traditional HTK. Besides, we construct OOVs names model in the language model layer and placed in the MSRS, this effectively reduces the WER at about 0.12%. Wang, Yih-Ru 王逸如 2012 學位論文 ; thesis 48 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立交通大學 === 電信工程研究所 === 100 === This study focuses on how to use a Weighted Finite-State Transducer (WFST) to construct a Mandarin Speech Recognition System (MSRS). It first introduces algorithms for WFST, as well as different levels of speech model to represent the Finite-State Machine (FSM) graph, and integrates into the MSRS. The experimental results indentify the Word Error Rate (WER) at about 55% is related to the appearance of OOV words, and statistics shows that one OOV word results in 2.4 words error averagely in MSRS. According to the statistical results, it shows that the names OOV words accounts for about 30%, and in which three words Chinese names accounts for about 23%, In order to reduce the negative impact of the OOV words results in the MSRS, we introduce a hierarchical language model, training name model to assist lower WER. The test corpus uses for the read-type long sentences TCC300 corpus. The 13.76% WER is obtained by using HTK two-stage recognition, while use of WFST RT=13 can achieve the same WER, the recognition speed is about 15 times faster than the traditional HTK. Besides, we construct OOVs names model in the language model layer and placed in the MSRS, this effectively reduces the WER at about 0.12%.
author2	Wang, Yih-Ru
author_facet	Wang, Yih-Ru Lin, Ang-Hsing 林昂星
author	Lin, Ang-Hsing 林昂星
spellingShingle	Lin, Ang-Hsing 林昂星 A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
author_sort	Lin, Ang-Hsing
title	A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_short	A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_full	A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_fullStr	A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_full_unstemmed	A Mandarin Speech Recognition System Using Weighted Finite-State Transducer
title_sort	mandarin speech recognition system using weighted finite-state transducer
publishDate	2012
url	http://ndltd.ncl.edu.tw/handle/28065850188969895626
work_keys_str_mv	AT linanghsing amandarinspeechrecognitionsystemusingweightedfinitestatetransducer AT línángxīng amandarinspeechrecognitionsystemusingweightedfinitestatetransducer AT linanghsing shǐyòngyǒuxiànzhuàngtàizhuǎnhuànqìzhīhànyǔyǔyīnbiànrènxìtǒng AT línángxīng shǐyòngyǒuxiànzhuàngtàizhuǎnhuànqìzhīhànyǔyǔyīnbiànrènxìtǒng AT linanghsing mandarinspeechrecognitionsystemusingweightedfinitestatetransducer AT línángxīng mandarinspeechrecognitionsystemusingweightedfinitestatetransducer
_version_	1718213524326449152

A Mandarin Speech Recognition System Using Weighted Finite-State Transducer

Similar Items