Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis

Additive synthesis models are popular for musical applications because they offer precise control over the temporal-spectral evolution of sound. The main difficulty in additive modelling lies in the estimation and tracking of model parameters for sounds with time-varying frequency content (this is...

Full description

Bibliographic Details
Main Author:	Kereliuk, Corey
Other Authors:	Philippe Depalle (Internal/Supervisor)
Format:	Others
Language:	en
Published:	McGill University 2008
Subjects:	Engineering - Electronics and Electrical
Online Access:	http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=21926

id	ndltd-LACETR-oai-collectionscanada.gc.ca-QMM.21926
record_format	oai_dc
spelling	ndltd-LACETR-oai-collectionscanada.gc.ca-QMM.219262014-02-13T03:56:53ZImproved hidden Markov model partial tracking for additive synthesis using time-frequency analysisKereliuk, CoreyEngineering - Electronics and ElectricalAdditive synthesis models are popular for musical applications because they offer precise control over the temporal-spectral evolution of sound. The main difficulty in additive modelling lies in the estimation and tracking of model parameters for sounds with time-varying frequency content (this is especially common in music). In the work presented here, a modification is proposed to the hidden Markov model developed by Depalle et al. which improves the estimation and tracking of partial frequencies for additive synthesis. The Wigner-Ville distribution is used in conjunction with the Hough transform in order to re-estimate the frequency and chirp rate from the short-term spectrum. These estimates are then used to formulate a new objective function to score state transitions in the hidden Markov model. The system performance is evaluated using a suite of real and synthetic test signals, and compared to state-of-the-art analysis tools. A clear improvement is demonstrated for highly non-stationary monophonic and polyphonic sounds. Additionally, crossing partials are detected to a limited extent, although further work is still needed to address this problem. This work identifies many strategies to improve additive modelling and suggests several directions for future research.Les modèles de synthèse additifs sont très prisés pour le traitement de sons musicaux car ils permettent un contrôle précis de l'évolution des propriétés spectrales et temporelles du signal sonore. La principale difficulté de ce type de modélisation provient de l'estimation et du suivi temporel des paramètres du modéle--en particulier pour les sons monophoniques à contenu spectral fortement variable ainsi que pour les sons polyphoniques (cas très courants en musique). Cette thèse s'attache à améliorer l'estimation et le suivi de partiels par modèle de Markov caché initialement proposé par Depalle et al.. La transformée de Wigner-Ville ainsi que la tranformée de Hough sont considérées de manière à ré-estimer la fréquence ainsi que sa derivée du premier ordre à partir du spectre à court-terme. Ces nouvelles estimations permettent de proposer une nouvelle fonction de score, utilisées dans la description des transitions entres états au sein du modèle de Markov caché. Des études de cas synthétiques et réels démontrent une amélioration des performances par rapport à des chaînes d'analyse répresentant l'état de l'art dans le domaine. L'amélioration la plus conséquente porte sur l'analyse de sons monophoniques hautement non-stationnaires et de sons polyphoniques. De plus, les croisements entre deux partiels sont jusqu'à un certain point detectés par le système. Quelques pistes prospectives sont proposées à la fin du document pour pallier ce difficile problème.McGill UniversityPhilippe Depalle (Internal/Supervisor)2008Electronic Thesis or Dissertationapplication/pdfenElectronically-submitted theses.All items in eScholarship@McGill are protected by copyright with all rights reserved unless otherwise indicated.Master of Arts (Schulich School of Music) http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=21926
collection	NDLTD
language	en
format	Others
sources	NDLTD
topic	Engineering - Electronics and Electrical
spellingShingle	Engineering - Electronics and Electrical Kereliuk, Corey Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis
description	Additive synthesis models are popular for musical applications because they offer precise control over the temporal-spectral evolution of sound. The main difficulty in additive modelling lies in the estimation and tracking of model parameters for sounds with time-varying frequency content (this is especially common in music). In the work presented here, a modification is proposed to the hidden Markov model developed by Depalle et al. which improves the estimation and tracking of partial frequencies for additive synthesis. The Wigner-Ville distribution is used in conjunction with the Hough transform in order to re-estimate the frequency and chirp rate from the short-term spectrum. These estimates are then used to formulate a new objective function to score state transitions in the hidden Markov model. The system performance is evaluated using a suite of real and synthetic test signals, and compared to state-of-the-art analysis tools. A clear improvement is demonstrated for highly non-stationary monophonic and polyphonic sounds. Additionally, crossing partials are detected to a limited extent, although further work is still needed to address this problem. This work identifies many strategies to improve additive modelling and suggests several directions for future research. === Les modèles de synthèse additifs sont très prisés pour le traitement de sons musicaux car ils permettent un contrôle précis de l'évolution des propriétés spectrales et temporelles du signal sonore. La principale difficulté de ce type de modélisation provient de l'estimation et du suivi temporel des paramètres du modéle--en particulier pour les sons monophoniques à contenu spectral fortement variable ainsi que pour les sons polyphoniques (cas très courants en musique). Cette thèse s'attache à améliorer l'estimation et le suivi de partiels par modèle de Markov caché initialement proposé par Depalle et al.. La transformée de Wigner-Ville ainsi que la tranformée de Hough sont considérées de manière à ré-estimer la fréquence ainsi que sa derivée du premier ordre à partir du spectre à court-terme. Ces nouvelles estimations permettent de proposer une nouvelle fonction de score, utilisées dans la description des transitions entres états au sein du modèle de Markov caché. Des études de cas synthétiques et réels démontrent une amélioration des performances par rapport à des chaînes d'analyse répresentant l'état de l'art dans le domaine. L'amélioration la plus conséquente porte sur l'analyse de sons monophoniques hautement non-stationnaires et de sons polyphoniques. De plus, les croisements entre deux partiels sont jusqu'à un certain point detectés par le système. Quelques pistes prospectives sont proposées à la fin du document pour pallier ce difficile problème.
author2	Philippe Depalle (Internal/Supervisor)
author_facet	Philippe Depalle (Internal/Supervisor) Kereliuk, Corey
author	Kereliuk, Corey
author_sort	Kereliuk, Corey
title	Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis
title_short	Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis
title_full	Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis
title_fullStr	Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis
title_full_unstemmed	Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis
title_sort	improved hidden markov model partial tracking for additive synthesis using time-frequency analysis
publisher	McGill University
publishDate	2008
url	http://digitool.Library.McGill.CA:80/R/?func=dbin-jump-full&object_id=21926
work_keys_str_mv	AT kereliukcorey improvedhiddenmarkovmodelpartialtrackingforadditivesynthesisusingtimefrequencyanalysis
_version_	1716641989832736768

Improved hidden Markov model partial tracking for additive synthesis using time-frequency analysis

Similar Items