Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. === Includes bibliographical references (p. 139-145). === The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the lin...

Full description

Bibliographic Details
Main Author: Mehta, Daryush (Daryush Dinyar)
Other Authors: Thomas F. Quatieri.
Format: Others
Language:English
Published: Massachusetts Institute of Technology 2009
Subjects:
Online Access:http://hdl.handle.net/1721.1/47819
id ndltd-MIT-oai-dspace.mit.edu-1721.1-47819
record_format oai_dc
spelling ndltd-MIT-oai-dspace.mit.edu-1721.1-478192019-05-02T15:46:47Z Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification Mehta, Daryush (Daryush Dinyar) Thomas F. Quatieri. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (p. 139-145). The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the linear source-filter model of speech production, we implement a vowel synthesizer in which the aspiration noise source is temporally modulated by the periodic source waveform. Modulations in the noise source waveform and their synchrony with the periodic source are shown to be salient for natural-sounding vowel synthesis. After developing the synthesis framework, we research past approaches to separate the two additive components of the model. A challenge for analysis based on this model is the accurate estimation of the aspiration noise component that contains energy across the frequency spectrum and temporal characteristics due to modulations in the noise source. Spectral harmonic/noise component analysis of spoken vowels shows evidence of noise modulations with peaks in the estimated noise source component synchronous with both the open phase of the periodic source and with time instants of glottal closure. Inspired by this observation of natural modulations in the aspiration noise source, we develop an alternate approach to the speech signal processing aim of accurate pitch-scale modification. The proposed strategy takes a dual processing approach, in which the periodic and noise components of the speech signal are separately analyzed, modified, and re-synthesized. The periodic component is modified using our implementation of time-domain pitch-synchronous overlap-add, and the noise component is handled by modifying characteristics of its source waveform. (cont.) Since we have modeled an inherent coupling between the original periodic and aspiration noise sources, the modification algorithm is designed to preserve the synchrony between temporal modulations of the two sources. The reconstructed modified signal is perceived to be natural-sounding and generally reduces artifacts that are typically heard in current modification techniques. by Daryush Mehta. S.M. 2009-10-01T15:46:52Z 2009-10-01T15:46:52Z 2006 2006 Thesis http://hdl.handle.net/1721.1/47819 429466806 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 145 p. application/pdf Massachusetts Institute of Technology
collection NDLTD
language English
format Others
sources NDLTD
topic Electrical Engineering and Computer Science.
spellingShingle Electrical Engineering and Computer Science.
Mehta, Daryush (Daryush Dinyar)
Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
description Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. === Includes bibliographical references (p. 139-145). === The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the linear source-filter model of speech production, we implement a vowel synthesizer in which the aspiration noise source is temporally modulated by the periodic source waveform. Modulations in the noise source waveform and their synchrony with the periodic source are shown to be salient for natural-sounding vowel synthesis. After developing the synthesis framework, we research past approaches to separate the two additive components of the model. A challenge for analysis based on this model is the accurate estimation of the aspiration noise component that contains energy across the frequency spectrum and temporal characteristics due to modulations in the noise source. Spectral harmonic/noise component analysis of spoken vowels shows evidence of noise modulations with peaks in the estimated noise source component synchronous with both the open phase of the periodic source and with time instants of glottal closure. Inspired by this observation of natural modulations in the aspiration noise source, we develop an alternate approach to the speech signal processing aim of accurate pitch-scale modification. The proposed strategy takes a dual processing approach, in which the periodic and noise components of the speech signal are separately analyzed, modified, and re-synthesized. The periodic component is modified using our implementation of time-domain pitch-synchronous overlap-add, and the noise component is handled by modifying characteristics of its source waveform. === (cont.) Since we have modeled an inherent coupling between the original periodic and aspiration noise sources, the modification algorithm is designed to preserve the synchrony between temporal modulations of the two sources. The reconstructed modified signal is perceived to be natural-sounding and generally reduces artifacts that are typically heard in current modification techniques. === by Daryush Mehta. === S.M.
author2 Thomas F. Quatieri.
author_facet Thomas F. Quatieri.
Mehta, Daryush (Daryush Dinyar)
author Mehta, Daryush (Daryush Dinyar)
author_sort Mehta, Daryush (Daryush Dinyar)
title Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
title_short Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
title_full Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
title_fullStr Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
title_full_unstemmed Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
title_sort aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
publisher Massachusetts Institute of Technology
publishDate 2009
url http://hdl.handle.net/1721.1/47819
work_keys_str_mv AT mehtadaryushdaryushdinyar aspirationnoiseduringphonationsynthesisanalysisandpitchscalemodification
_version_ 1719028421819891712