Quickly Personalizable Digit Mobile Speech Recognition System Based on Sphinx

碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide...

Full description

Bibliographic Details
Main Authors: Tsung-peng Yen, 顏宗芃
Other Authors: Chia-Ping Chen
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/58419670260226264357
Description
Summary:碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition services. This system is built on internet, users can easily utilize the system through the network. Besides the speech recognition service in our system, we also provide adaptation function to bring up the Noise-Robust between differences environment. In the case of English digit recognition, our recognition system can achieve 80% accuracy for a specific speaker by using a few adaptation. Our system can also be expanded for building program and other relevant application. We use Sphinx-4 as a speech recognition kernel in our system. Because Sphinx-4 is a system prepared exclusively for researchers, it is a flexible, modular and pluggable framework. By the pluggability characteristic of Sphinx-4, we can replace the dictionary, grammar and acoustic model easily by edit the configuration files. In order to make sense about choosing acoustic model, training data and adaptation data. We provide our experiment results on AURORA2, EAT and Android device recording from corpus for references.