Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram

碩士 === 國立交通大學 === 電信工程研究所 === 102 === In recent years, pitch plays an important role in audio signal processing. Pitch tracking used in a wide range of applications. Single pitch tracking can make the error between the estimated pitch and true pitch within 5% in 90% frames, but there is a lot of roo...

Full description

Bibliographic Details
Main Authors: Hsieh, Kun-Yeh, 謝坤燁
Other Authors: Chi, Tai-Shih
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/8d76n5
id ndltd-TW-102NCTU5435093
record_format oai_dc
spelling ndltd-TW-102NCTU54350932019-05-15T21:50:56Z http://ndltd.ncl.edu.tw/handle/8d76n5 Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram 以傅立葉轉換之時頻域調變為特徵之因子隱藏式馬可夫模型多音高追蹤法 Hsieh, Kun-Yeh 謝坤燁 碩士 國立交通大學 電信工程研究所 102 In recent years, pitch plays an important role in audio signal processing. Pitch tracking used in a wide range of applications. Single pitch tracking can make the error between the estimated pitch and true pitch within 5% in 90% frames, but there is a lot of room for improvement in multiple pitch tracking. In this thesis, we will apply Robust Algorithm Pitch Tracking (RAPT) to track the single speaker signal and to build up the prior probability and transition probability matrix of each speaker, and then we convert the spectrogram into rate-scale domain by the means which is inspired by cortical stage of auditory perceptual model. We use the value of rate-scale domain as feature vector and model the feature vector using Gaussian mixture models. Then we employ the mixture maximization model to establish the probability model for the feature vector of mixture speech. Finally, a FHMM is applied for tracking pitch over time. In the result of experiment, we found the system using rate-scale as feature vector has much capability of resisting noise than spectrum. Chi, Tai-Shih 冀泰石 2014 學位論文 ; thesis 47 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程研究所 === 102 === In recent years, pitch plays an important role in audio signal processing. Pitch tracking used in a wide range of applications. Single pitch tracking can make the error between the estimated pitch and true pitch within 5% in 90% frames, but there is a lot of room for improvement in multiple pitch tracking. In this thesis, we will apply Robust Algorithm Pitch Tracking (RAPT) to track the single speaker signal and to build up the prior probability and transition probability matrix of each speaker, and then we convert the spectrogram into rate-scale domain by the means which is inspired by cortical stage of auditory perceptual model. We use the value of rate-scale domain as feature vector and model the feature vector using Gaussian mixture models. Then we employ the mixture maximization model to establish the probability model for the feature vector of mixture speech. Finally, a FHMM is applied for tracking pitch over time. In the result of experiment, we found the system using rate-scale as feature vector has much capability of resisting noise than spectrum.
author2 Chi, Tai-Shih
author_facet Chi, Tai-Shih
Hsieh, Kun-Yeh
謝坤燁
author Hsieh, Kun-Yeh
謝坤燁
spellingShingle Hsieh, Kun-Yeh
謝坤燁
Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
author_sort Hsieh, Kun-Yeh
title Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
title_short Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
title_full Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
title_fullStr Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
title_full_unstemmed Multipitch Tracking by Factorial Hidden Markov Model Using Spectro-Temporal Modulations of Fourier Spectrogram
title_sort multipitch tracking by factorial hidden markov model using spectro-temporal modulations of fourier spectrogram
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/8d76n5
work_keys_str_mv AT hsiehkunyeh multipitchtrackingbyfactorialhiddenmarkovmodelusingspectrotemporalmodulationsoffourierspectrogram
AT xièkūnyè multipitchtrackingbyfactorialhiddenmarkovmodelusingspectrotemporalmodulationsoffourierspectrogram
AT hsiehkunyeh yǐfùlìyèzhuǎnhuànzhīshípínyùdiàobiànwèitèzhēngzhīyīnziyǐncángshìmǎkěfūmóxíngduōyīngāozhuīzōngfǎ
AT xièkūnyè yǐfùlìyèzhuǎnhuànzhīshípínyùdiàobiànwèitèzhēngzhīyīnziyǐncángshìmǎkěfūmóxíngduōyīngāozhuīzōngfǎ
_version_ 1719119913243639808