Summary: | 碩士 === 國立中山大學 === 資訊工程學系研究所 === 101 === In this paper, we are going to introduce a system which provide digit speech recognition
services. This system is built on internet, users can easily utilize the system through the
network. Besides the speech recognition service in our system, we also provide adaptation
function to bring up the Noise-Robust between differences environment. In the case of English
digit recognition, our recognition system can achieve 80% accuracy for a specific
speaker by using a few adaptation. Our system can also be expanded for building program
and other relevant application. We use Sphinx-4 as a speech recognition kernel in our system.
Because Sphinx-4 is a system prepared exclusively for researchers, it is a flexible, modular
and pluggable framework. By the pluggability characteristic of Sphinx-4, we can replace the
dictionary, grammar and acoustic model easily by edit the configuration files. In order to
make sense about choosing acoustic model, training data and adaptation data. We provide
our experiment results on AURORA2, EAT and Android device recording from corpus for
references.
|