Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
碩士 === 國立交通大學 === 電信工程研究所 === 102 === In recent years, pitch plays an important role in audio signal processing. Pitch tracking used in a wide range of applications. Single pitch tracking can make the error between the estimated pitch and true pitch within 5% in 90% frames, but there is a lot of roo...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2014
|
Online Access: | http://ndltd.ncl.edu.tw/handle/8d76n5 |
id |
ndltd-TW-102NCTU5435093 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-102NCTU54350932019-05-15T21:50:56Z http://ndltd.ncl.edu.tw/handle/8d76n5 Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram 以傅立葉轉換之時頻域調變為特徵之因子隱藏式馬可夫模型多音高追蹤法 Hsieh, Kun-Yeh 謝坤燁 碩士 國立交通大學 電信工程研究所 102 In recent years, pitch plays an important role in audio signal processing. Pitch tracking used in a wide range of applications. Single pitch tracking can make the error between the estimated pitch and true pitch within 5% in 90% frames, but there is a lot of room for improvement in multiple pitch tracking. In this thesis, we will apply Robust Algorithm Pitch Tracking (RAPT) to track the single speaker signal and to build up the prior probability and transition probability matrix of each speaker, and then we convert the spectrogram into rate-scale domain by the means which is inspired by cortical stage of auditory perceptual model. We use the value of rate-scale domain as feature vector and model the feature vector using Gaussian mixture models. Then we employ the mixture maximization model to establish the probability model for the feature vector of mixture speech. Finally, a FHMM is applied for tracking pitch over time. In the result of experiment, we found the system using rate-scale as feature vector has much capability of resisting noise than spectrum. Chi, Tai-Shih 冀泰石 2014 學位論文 ; thesis 47 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立交通大學 === 電信工程研究所 === 102 === In recent years, pitch plays an important role in audio signal processing. Pitch tracking used in a wide range of applications. Single pitch tracking can make the error between the estimated pitch and true pitch within 5% in 90% frames, but there is a lot of room for improvement in multiple pitch tracking.
In this thesis, we will apply Robust Algorithm Pitch Tracking (RAPT) to track the single speaker signal and to build up the prior probability and transition probability matrix of each speaker, and then we convert the spectrogram into rate-scale domain by the means which is inspired by cortical stage of auditory perceptual model. We use the value of rate-scale domain as feature vector and model the feature vector using Gaussian mixture models. Then we employ the mixture maximization model to establish the probability model for the feature vector of mixture speech. Finally, a FHMM is applied for tracking pitch over time. In the result of experiment, we found the system using rate-scale as feature vector has much capability of resisting noise than spectrum.
|
author2 |
Chi, Tai-Shih |
author_facet |
Chi, Tai-Shih Hsieh, Kun-Yeh 謝坤燁 |
author |
Hsieh, Kun-Yeh 謝坤燁 |
spellingShingle |
Hsieh, Kun-Yeh 謝坤燁 Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram |
author_sort |
Hsieh, Kun-Yeh |
title |
Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram |
title_short |
Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram |
title_full |
Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram |
title_fullStr |
Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram |
title_full_unstemmed |
Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram |
title_sort |
multipitch tracking by factorial hidden markov model using spectro-temporal modulations of fourier spectrogram |
publishDate |
2014 |
url |
http://ndltd.ncl.edu.tw/handle/8d76n5 |
work_keys_str_mv |
AT hsiehkunyeh multipitchtrackingbyfactorialhiddenmarkovmodelusingspectrotemporalmodulationsoffourierspectrogram AT xièkūnyè multipitchtrackingbyfactorialhiddenmarkovmodelusingspectrotemporalmodulationsoffourierspectrogram AT hsiehkunyeh yǐfùlìyèzhuǎnhuànzhīshípínyùdiàobiànwèitèzhēngzhīyīnziyǐncángshìmǎkěfūmóxíngduōyīngāozhuīzōngfǎ AT xièkūnyè yǐfùlìyèzhuǎnhuànzhīshípínyùdiàobiànwèitèzhēngzhīyīnziyǐncángshìmǎkěfūmóxíngduōyīngāozhuīzōngfǎ |
_version_ |
1719119913243639808 |