Summary: | 碩士 === 中原大學 === 資訊工程研究所 === 91 === Using speech signal as input to manipulate computer is an important application of speech signal processing research. The system described in our study, is implemented by using Token Passing Model with keyword spotting under Microsoft Visual C++ 6.0 and Windows 2000 operation system.
The system utilize the coefficient of Mel-Frequency Cepstrum as the feature parameter, then use the method of CHMM (Continuous Hidden Markov Model) to establish acoustic model. The 415 syllables in Mandarin are further decomposed into right context dependent sub-syllabic units, which are 113 Right Context Dependent INITIAL and 39 Context Independent FINAL. The INITIAL/FINAL are represented by 3-state/4-state. In addition, build a silence and short pause acoustic model, which are represented by 4-state. Then the system uses Token Passing to build a filler model keyword spotting system, and uses the Utterance Verification technology to verify the utterance correct or incorrect.
Finally, the research develops a speech input interface system and uses hierarchical architecture method to reduce the number of keyword and uses agent of Text To Speech to lead the users. The system also uses positive and negative sentence to promote the detection rate of keyword spotting.
|