A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition

碩士 === 國立清華大學 === 資訊工程學系 === 91 === Today, mobile phones and personal digital devices are made to be smaller and with more functions in them. In this way, the traditional typing input scheme becomes inconvenient to use. Voice input should be a good resolution. In this thesis,...

Full description

Bibliographic Details
Main Authors: Hsieh Chia-yang, 謝佳揚
Other Authors: Jyh-Shing Roger Jang
Format: Others
Language:zh-TW
Published: 2003
Online Access:http://ndltd.ncl.edu.tw/handle/15283721042529571140
id ndltd-TW-091NTHU0392032
record_format oai_dc
spelling ndltd-TW-091NTHU03920322016-06-22T04:26:24Z http://ndltd.ncl.edu.tw/handle/15283721042529571140 A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition 樣型比對技術應用於中文數字語音辨認之研究 Hsieh Chia-yang 謝佳揚 碩士 國立清華大學 資訊工程學系 91 Today, mobile phones and personal digital devices are made to be smaller and with more functions in them. In this way, the traditional typing input scheme becomes inconvenient to use. Voice input should be a good resolution. In this thesis, we try to develop a speaker-independent Mandarin digits speech recognition system based on discrete HMM with simple models, low computation, and a high recognition rate. We use several techniques to improve the accuracy, including feature extraction, feature vector quantization, classifier combination, corrective training. In feature extraction part, we use nonuniform frame shifting (NUFS) to increase the weight of beginning parts. In feature vector quantization part, we use separate codebook for each digit. The distance between feature vector and codebook center can be a very good classifier of digit recognition. Furthermore, combining the distance with log probability of DHMM can also increase the accuracy. We also applied corrective training which can correct the model that is classified incorrectly or has a log probability close to the target one. Additionally, we use segmental probability model (SPM) to reduce the computation time. Jyh-Shing Roger Jang 張智星 2003 學位論文 ; thesis 38 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立清華大學 === 資訊工程學系 === 91 === Today, mobile phones and personal digital devices are made to be smaller and with more functions in them. In this way, the traditional typing input scheme becomes inconvenient to use. Voice input should be a good resolution. In this thesis, we try to develop a speaker-independent Mandarin digits speech recognition system based on discrete HMM with simple models, low computation, and a high recognition rate. We use several techniques to improve the accuracy, including feature extraction, feature vector quantization, classifier combination, corrective training. In feature extraction part, we use nonuniform frame shifting (NUFS) to increase the weight of beginning parts. In feature vector quantization part, we use separate codebook for each digit. The distance between feature vector and codebook center can be a very good classifier of digit recognition. Furthermore, combining the distance with log probability of DHMM can also increase the accuracy. We also applied corrective training which can correct the model that is classified incorrectly or has a log probability close to the target one. Additionally, we use segmental probability model (SPM) to reduce the computation time.
author2 Jyh-Shing Roger Jang
author_facet Jyh-Shing Roger Jang
Hsieh Chia-yang
謝佳揚
author Hsieh Chia-yang
謝佳揚
spellingShingle Hsieh Chia-yang
謝佳揚
A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
author_sort Hsieh Chia-yang
title A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
title_short A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
title_full A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
title_fullStr A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
title_full_unstemmed A Study of Several Pattern Matching Techniques with Application to Isolated Mandarin Digit Speech Recognition
title_sort study of several pattern matching techniques with application to isolated mandarin digit speech recognition
publishDate 2003
url http://ndltd.ncl.edu.tw/handle/15283721042529571140
work_keys_str_mv AT hsiehchiayang astudyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition
AT xièjiāyáng astudyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition
AT hsiehchiayang yàngxíngbǐduìjìshùyīngyòngyúzhōngwénshùzìyǔyīnbiànrènzhīyánjiū
AT xièjiāyáng yàngxíngbǐduìjìshùyīngyòngyúzhōngwénshùzìyǔyīnbiànrènzhīyánjiū
AT hsiehchiayang studyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition
AT xièjiāyáng studyofseveralpatternmatchingtechniqueswithapplicationtoisolatedmandarindigitspeechrecognition
_version_ 1718319342737686528