Automatic Speech Recognition Using Finite Inductive Sequences
This dissertation addresses the general problem of recognition of acoustic signals which may be derived from speech, sonar, or acoustic phenomena. The specific problem of recognizing speech is the main focus of this research. The intention is to design a recognition system for a definite number of d...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
University of North Texas
1996
|
Subjects: | |
Online Access: | https://digital.library.unt.edu/ark:/67531/metadc277749/ |
id |
ndltd-unt.edu-info-ark-67531-metadc277749 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-unt.edu-info-ark-67531-metadc2777492017-03-17T08:40:38Z Automatic Speech Recognition Using Finite Inductive Sequences Cherri, Mona Youssef, 1956- speech recognition computer science linear predictive coding Automatic speech recognition. This dissertation addresses the general problem of recognition of acoustic signals which may be derived from speech, sonar, or acoustic phenomena. The specific problem of recognizing speech is the main focus of this research. The intention is to design a recognition system for a definite number of discrete words. For this purpose specifically, eight isolated words from the T1MIT database are selected. Four medium length words "greasy," "dark," "wash," and "water" are used. In addition, four short words are considered "she," "had," "in," and "all." The recognition system addresses the following issues: filtering or preprocessing, training, and decision-making. The preprocessing phase uses linear predictive coding of order 12. Following the filtering process, a vector quantization method is used to further reduce the input data and generate a finite inductive sequence of symbols representative of each input signal. The sequences generated by the vector quantization process of the same word are factored, and a single ruling or reference template is generated and stored in a codebook. This system introduces a new modeling technique which relies heavily on the basic concept that all finite sequences are finitely inductive. This technique is used in the training stage. In order to accommodate the variabilities in speech, the training is performed casualty, and a large number of training speakers is used from eight different dialect regions. Hence, a speaker independent recognition system is realized. The matching process compares the incoming speech with each of the templates stored, and a closeness ration is computed. A ratio table is generated anH the matching word that corresponds to the smallest ratio (i.e. indicating that the ruling has removed most of the symbols) is selected. Promising results were obtained for isolated words, and the recognition rates ranged between 50% and 100%. University of North Texas Fisher, Paul S. Allen, John Ed, 1937- Yang, Chao-Chih Tate, Stephen B. 1996-08 Thesis or Dissertation viii, 167 leaves : ill. Text call-no: 379 N81d no.4318 untcat: b2019915 local-cont-no: 1002727182-cherri https://digital.library.unt.edu/ark:/67531/metadc277749/ ark: ark:/67531/metadc277749 English Public Copyright Copyright is held by the author, unless otherwise noted. All rights reserved. Cherri, Mona Youssef, 1956- |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
speech recognition computer science linear predictive coding Automatic speech recognition. |
spellingShingle |
speech recognition computer science linear predictive coding Automatic speech recognition. Cherri, Mona Youssef, 1956- Automatic Speech Recognition Using Finite Inductive Sequences |
description |
This dissertation addresses the general problem of recognition of acoustic signals which may be derived from speech, sonar, or acoustic phenomena. The specific problem of recognizing speech is the main focus of this research. The intention is to design a recognition system for a definite number of discrete words. For this purpose specifically, eight isolated words from the T1MIT database are selected. Four medium length words "greasy," "dark," "wash," and "water" are used. In addition, four short words are considered "she," "had," "in," and "all." The recognition system addresses the following issues: filtering or preprocessing, training, and decision-making. The preprocessing phase uses linear predictive coding of order 12. Following the filtering process, a vector quantization method is used to further reduce the input data and generate a finite inductive sequence of symbols representative of each input signal. The sequences generated by the vector quantization process of the same word are factored, and a single ruling or reference template is generated and stored in a codebook. This system introduces a new modeling technique which relies heavily on the basic concept that all finite sequences are finitely inductive. This technique is used in the training stage. In order to accommodate the variabilities in speech, the training is performed casualty, and a large number of training speakers is used from eight different dialect regions. Hence, a speaker independent recognition system is realized. The matching process compares the incoming speech with each of the templates stored, and a closeness ration is computed. A ratio table is generated anH the matching word that corresponds to the smallest ratio (i.e. indicating that the ruling has removed most of the symbols) is selected. Promising results were obtained for isolated words, and the recognition rates ranged between 50% and 100%. |
author2 |
Fisher, Paul S. |
author_facet |
Fisher, Paul S. Cherri, Mona Youssef, 1956- |
author |
Cherri, Mona Youssef, 1956- |
author_sort |
Cherri, Mona Youssef, 1956- |
title |
Automatic Speech Recognition Using Finite Inductive Sequences |
title_short |
Automatic Speech Recognition Using Finite Inductive Sequences |
title_full |
Automatic Speech Recognition Using Finite Inductive Sequences |
title_fullStr |
Automatic Speech Recognition Using Finite Inductive Sequences |
title_full_unstemmed |
Automatic Speech Recognition Using Finite Inductive Sequences |
title_sort |
automatic speech recognition using finite inductive sequences |
publisher |
University of North Texas |
publishDate |
1996 |
url |
https://digital.library.unt.edu/ark:/67531/metadc277749/ |
work_keys_str_mv |
AT cherrimonayoussef1956 automaticspeechrecognitionusingfiniteinductivesequences |
_version_ |
1718431696172351488 |