Probabilistic space maps for speech with applications

The objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in es...

Full description

Bibliographic Details
Main Author: Kalgaonkar, Kaustubh
Published: Georgia Institute of Technology 2012
Subjects:
Online Access:http://hdl.handle.net/1853/42739
id ndltd-GATECH-oai-smartech.gatech.edu-1853-42739
record_format oai_dc
spelling ndltd-GATECH-oai-smartech.gatech.edu-1853-427392013-01-07T20:38:23ZProbabilistic space maps for speech with applicationsKalgaonkar, KaustubhAutomatic bandwidth expansionProbabilistic space mapsStatistical modelsAcoustic model adaptationSpeech enhancementSpeech perceptionSpeech processing systemsThe objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in estimating the VTAF from speech data is investigated. The latent variable model uses this uncertainty to generate many-to-one mapping between observations of the VTAF and speech spectra. The second uses the probabilistic model of speech production to improve the performance of traditional speech algorithms, such as enhancement, acoustic model adaptation, etc. In this thesis, we propose to model the process of speech production with a probability map. This proposed model treats speech production as a probabilistic process with many-to-one mapping between VTAF and speech spectra. The thesis not only outlines a statistical framework to generate and train these probabilistic models from speech, but also demonstrates its power and flexibility with such applications as enhancing speech from both perceptual and recognition perspectives.Georgia Institute of Technology2012-02-17T19:18:27Z2012-02-17T19:18:27Z2011-08-22Dissertationhttp://hdl.handle.net/1853/42739
collection NDLTD
sources NDLTD
topic Automatic bandwidth expansion
Probabilistic space maps
Statistical models
Acoustic model adaptation
Speech enhancement
Speech perception
Speech processing systems
spellingShingle Automatic bandwidth expansion
Probabilistic space maps
Statistical models
Acoustic model adaptation
Speech enhancement
Speech perception
Speech processing systems
Kalgaonkar, Kaustubh
Probabilistic space maps for speech with applications
description The objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in estimating the VTAF from speech data is investigated. The latent variable model uses this uncertainty to generate many-to-one mapping between observations of the VTAF and speech spectra. The second uses the probabilistic model of speech production to improve the performance of traditional speech algorithms, such as enhancement, acoustic model adaptation, etc. In this thesis, we propose to model the process of speech production with a probability map. This proposed model treats speech production as a probabilistic process with many-to-one mapping between VTAF and speech spectra. The thesis not only outlines a statistical framework to generate and train these probabilistic models from speech, but also demonstrates its power and flexibility with such applications as enhancing speech from both perceptual and recognition perspectives.
author Kalgaonkar, Kaustubh
author_facet Kalgaonkar, Kaustubh
author_sort Kalgaonkar, Kaustubh
title Probabilistic space maps for speech with applications
title_short Probabilistic space maps for speech with applications
title_full Probabilistic space maps for speech with applications
title_fullStr Probabilistic space maps for speech with applications
title_full_unstemmed Probabilistic space maps for speech with applications
title_sort probabilistic space maps for speech with applications
publisher Georgia Institute of Technology
publishDate 2012
url http://hdl.handle.net/1853/42739
work_keys_str_mv AT kalgaonkarkaustubh probabilisticspacemapsforspeechwithapplications
_version_ 1716475615791546368