Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. === Includes bibliographical references (p. 139-145). === The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the lin...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | English |
Published: |
Massachusetts Institute of Technology
2009
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/47819 |
id |
ndltd-MIT-oai-dspace.mit.edu-1721.1-47819 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-MIT-oai-dspace.mit.edu-1721.1-478192019-05-02T15:46:47Z Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification Mehta, Daryush (Daryush Dinyar) Thomas F. Quatieri. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. Includes bibliographical references (p. 139-145). The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the linear source-filter model of speech production, we implement a vowel synthesizer in which the aspiration noise source is temporally modulated by the periodic source waveform. Modulations in the noise source waveform and their synchrony with the periodic source are shown to be salient for natural-sounding vowel synthesis. After developing the synthesis framework, we research past approaches to separate the two additive components of the model. A challenge for analysis based on this model is the accurate estimation of the aspiration noise component that contains energy across the frequency spectrum and temporal characteristics due to modulations in the noise source. Spectral harmonic/noise component analysis of spoken vowels shows evidence of noise modulations with peaks in the estimated noise source component synchronous with both the open phase of the periodic source and with time instants of glottal closure. Inspired by this observation of natural modulations in the aspiration noise source, we develop an alternate approach to the speech signal processing aim of accurate pitch-scale modification. The proposed strategy takes a dual processing approach, in which the periodic and noise components of the speech signal are separately analyzed, modified, and re-synthesized. The periodic component is modified using our implementation of time-domain pitch-synchronous overlap-add, and the noise component is handled by modifying characteristics of its source waveform. (cont.) Since we have modeled an inherent coupling between the original periodic and aspiration noise sources, the modification algorithm is designed to preserve the synchrony between temporal modulations of the two sources. The reconstructed modified signal is perceived to be natural-sounding and generally reduces artifacts that are typically heard in current modification techniques. by Daryush Mehta. S.M. 2009-10-01T15:46:52Z 2009-10-01T15:46:52Z 2006 2006 Thesis http://hdl.handle.net/1721.1/47819 429466806 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 145 p. application/pdf Massachusetts Institute of Technology |
collection |
NDLTD |
language |
English |
format |
Others
|
sources |
NDLTD |
topic |
Electrical Engineering and Computer Science. |
spellingShingle |
Electrical Engineering and Computer Science. Mehta, Daryush (Daryush Dinyar) Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
description |
Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2006. === Includes bibliographical references (p. 139-145). === The current study investigates the synthesis and analysis of aspiration noise in synthesized and spoken vowels. Based on the linear source-filter model of speech production, we implement a vowel synthesizer in which the aspiration noise source is temporally modulated by the periodic source waveform. Modulations in the noise source waveform and their synchrony with the periodic source are shown to be salient for natural-sounding vowel synthesis. After developing the synthesis framework, we research past approaches to separate the two additive components of the model. A challenge for analysis based on this model is the accurate estimation of the aspiration noise component that contains energy across the frequency spectrum and temporal characteristics due to modulations in the noise source. Spectral harmonic/noise component analysis of spoken vowels shows evidence of noise modulations with peaks in the estimated noise source component synchronous with both the open phase of the periodic source and with time instants of glottal closure. Inspired by this observation of natural modulations in the aspiration noise source, we develop an alternate approach to the speech signal processing aim of accurate pitch-scale modification. The proposed strategy takes a dual processing approach, in which the periodic and noise components of the speech signal are separately analyzed, modified, and re-synthesized. The periodic component is modified using our implementation of time-domain pitch-synchronous overlap-add, and the noise component is handled by modifying characteristics of its source waveform. === (cont.) Since we have modeled an inherent coupling between the original periodic and aspiration noise sources, the modification algorithm is designed to preserve the synchrony between temporal modulations of the two sources. The reconstructed modified signal is perceived to be natural-sounding and generally reduces artifacts that are typically heard in current modification techniques. === by Daryush Mehta. === S.M. |
author2 |
Thomas F. Quatieri. |
author_facet |
Thomas F. Quatieri. Mehta, Daryush (Daryush Dinyar) |
author |
Mehta, Daryush (Daryush Dinyar) |
author_sort |
Mehta, Daryush (Daryush Dinyar) |
title |
Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
title_short |
Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
title_full |
Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
title_fullStr |
Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
title_full_unstemmed |
Aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
title_sort |
aspiration noise during phonation : synthesis, analysis, and pitch-scale modification |
publisher |
Massachusetts Institute of Technology |
publishDate |
2009 |
url |
http://hdl.handle.net/1721.1/47819 |
work_keys_str_mv |
AT mehtadaryushdaryushdinyar aspirationnoiseduringphonationsynthesisanalysisandpitchscalemodification |
_version_ |
1719028421819891712 |