Probabilistic space maps for speech with applications
The objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in es...
Main Author: | |
---|---|
Published: |
Georgia Institute of Technology
2012
|
Subjects: | |
Online Access: | http://hdl.handle.net/1853/42739 |
id |
ndltd-GATECH-oai-smartech.gatech.edu-1853-42739 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-GATECH-oai-smartech.gatech.edu-1853-427392013-01-07T20:38:23ZProbabilistic space maps for speech with applicationsKalgaonkar, KaustubhAutomatic bandwidth expansionProbabilistic space mapsStatistical modelsAcoustic model adaptationSpeech enhancementSpeech perceptionSpeech processing systemsThe objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in estimating the VTAF from speech data is investigated. The latent variable model uses this uncertainty to generate many-to-one mapping between observations of the VTAF and speech spectra. The second uses the probabilistic model of speech production to improve the performance of traditional speech algorithms, such as enhancement, acoustic model adaptation, etc. In this thesis, we propose to model the process of speech production with a probability map. This proposed model treats speech production as a probabilistic process with many-to-one mapping between VTAF and speech spectra. The thesis not only outlines a statistical framework to generate and train these probabilistic models from speech, but also demonstrates its power and flexibility with such applications as enhancing speech from both perceptual and recognition perspectives.Georgia Institute of Technology2012-02-17T19:18:27Z2012-02-17T19:18:27Z2011-08-22Dissertationhttp://hdl.handle.net/1853/42739 |
collection |
NDLTD |
sources |
NDLTD |
topic |
Automatic bandwidth expansion Probabilistic space maps Statistical models Acoustic model adaptation Speech enhancement Speech perception Speech processing systems |
spellingShingle |
Automatic bandwidth expansion Probabilistic space maps Statistical models Acoustic model adaptation Speech enhancement Speech perception Speech processing systems Kalgaonkar, Kaustubh Probabilistic space maps for speech with applications |
description |
The objective of the proposed research is to develop a probabilistic model of speech production that exploits the multiplicity of mapping between the vocal tract area functions (VTAF) and speech spectra. Two thrusts are developed. In the first, a latent variable model that captures uncertainty in estimating the VTAF from speech data is investigated. The latent variable model uses this uncertainty to generate many-to-one mapping between observations of the VTAF and speech spectra. The second uses the probabilistic model of speech production to improve the performance of traditional speech algorithms, such as enhancement, acoustic model adaptation, etc.
In this thesis, we propose to model the process of speech production with a probability map. This proposed model treats speech production as a probabilistic process with many-to-one mapping between VTAF and speech spectra. The thesis not only outlines a statistical framework to generate and train these probabilistic models from speech, but also demonstrates its power and flexibility with such applications as enhancing speech from both perceptual and recognition perspectives. |
author |
Kalgaonkar, Kaustubh |
author_facet |
Kalgaonkar, Kaustubh |
author_sort |
Kalgaonkar, Kaustubh |
title |
Probabilistic space maps for speech with applications |
title_short |
Probabilistic space maps for speech with applications |
title_full |
Probabilistic space maps for speech with applications |
title_fullStr |
Probabilistic space maps for speech with applications |
title_full_unstemmed |
Probabilistic space maps for speech with applications |
title_sort |
probabilistic space maps for speech with applications |
publisher |
Georgia Institute of Technology |
publishDate |
2012 |
url |
http://hdl.handle.net/1853/42739 |
work_keys_str_mv |
AT kalgaonkarkaustubh probabilisticspacemapsforspeechwithapplications |
_version_ |
1716475615791546368 |