Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System

It is well known that machines perform far worse than humans in recognizing speech and audio, especially in noisy environments. One method of addressing this issue of robustness is to study physiological models of the human auditory system and to adopt some of its characteristics in computers. As a...

Full description

Bibliographic Details
Main Author: Jeon, Woojay
Format: Others
Language:en_US
Published: Georgia Institute of Technology 2007
Subjects:
Online Access:http://hdl.handle.net/1853/14061
id ndltd-GATECH-oai-smartech.gatech.edu-1853-14061
record_format oai_dc
spelling ndltd-GATECH-oai-smartech.gatech.edu-1853-140612013-01-07T20:16:32ZSpeech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory SystemJeon, WoojaySpeech processingSpeech recognitionFeature selectionPattern recognitionSpeech analysisAuditory modelAutomatic speech recognitionAuditory cortexIt is well known that machines perform far worse than humans in recognizing speech and audio, especially in noisy environments. One method of addressing this issue of robustness is to study physiological models of the human auditory system and to adopt some of its characteristics in computers. As a first step in studying the potential benefits of an elaborate computational model of the primary auditory cortex (A1) in the central auditory system, we qualitatively and quantitatively validate the model under existing speech processing recognition methodology. Next, we develop new insights and ideas on how to interpret the model, and reveal some of the advantages of its dimension-expansion that may be potentially used to improve existing speech processing and recognition methods. This is done by statistically analyzing the neural responses to various classes of speech signals and forming empirical conjectures on how cognitive information is encoded in a category-dependent manner. We also establish a theoretical framework that shows how noise and signal can be separated in the dimension-expanded cortical space. Finally, we develop new feature selection and pattern recognition methods to exploit the category-dependent encoding of noise-robust cognitive information in the cortical response. Category-dependent features are proposed as features that "specialize" in discriminating specific sets of classes, and as a natural way of incorporating them into a Bayesian decision framework, we propose methods to construct hierarchical classifiers that perform decisions in a two-stage process. Phoneme classification tasks using the TIMIT speech database are performed to quantitatively validate all developments in this work, and the results encourage future work in exploiting high-dimensional data with category(or class)-dependent features for improved classification or detection.Georgia Institute of Technology2007-03-27T18:20:04Z2007-03-27T18:20:04Z2006-11-13Dissertation1118983 bytesapplication/pdfhttp://hdl.handle.net/1853/14061en_US
collection NDLTD
language en_US
format Others
sources NDLTD
topic Speech processing
Speech recognition
Feature selection
Pattern recognition
Speech analysis
Auditory model
Automatic speech recognition
Auditory cortex
spellingShingle Speech processing
Speech recognition
Feature selection
Pattern recognition
Speech analysis
Auditory model
Automatic speech recognition
Auditory cortex
Jeon, Woojay
Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System
description It is well known that machines perform far worse than humans in recognizing speech and audio, especially in noisy environments. One method of addressing this issue of robustness is to study physiological models of the human auditory system and to adopt some of its characteristics in computers. As a first step in studying the potential benefits of an elaborate computational model of the primary auditory cortex (A1) in the central auditory system, we qualitatively and quantitatively validate the model under existing speech processing recognition methodology. Next, we develop new insights and ideas on how to interpret the model, and reveal some of the advantages of its dimension-expansion that may be potentially used to improve existing speech processing and recognition methods. This is done by statistically analyzing the neural responses to various classes of speech signals and forming empirical conjectures on how cognitive information is encoded in a category-dependent manner. We also establish a theoretical framework that shows how noise and signal can be separated in the dimension-expanded cortical space. Finally, we develop new feature selection and pattern recognition methods to exploit the category-dependent encoding of noise-robust cognitive information in the cortical response. Category-dependent features are proposed as features that "specialize" in discriminating specific sets of classes, and as a natural way of incorporating them into a Bayesian decision framework, we propose methods to construct hierarchical classifiers that perform decisions in a two-stage process. Phoneme classification tasks using the TIMIT speech database are performed to quantitatively validate all developments in this work, and the results encourage future work in exploiting high-dimensional data with category(or class)-dependent features for improved classification or detection.
author Jeon, Woojay
author_facet Jeon, Woojay
author_sort Jeon, Woojay
title Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System
title_short Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System
title_full Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System
title_fullStr Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System
title_full_unstemmed Speech Analysis and Cognition Using Category-Dependent Features in a Model of the Central Auditory System
title_sort speech analysis and cognition using category-dependent features in a model of the central auditory system
publisher Georgia Institute of Technology
publishDate 2007
url http://hdl.handle.net/1853/14061
work_keys_str_mv AT jeonwoojay speechanalysisandcognitionusingcategorydependentfeaturesinamodelofthecentralauditorysystem
_version_ 1716474571795726336