Objective Assessment of Speech Quality by Perceptual Features

碩士 === 國立交通大學 === 電信工程系所 === 96 === In this study, a joint spectro-temporal auditory model was utilized to assess speech quality objectively. In this model, the first stage is to mimic early cochlear functions of the spectrum estimation and the second stage is to mimic cortical functions of the mult...

Full description

Bibliographic Details
Main Authors: Ting-Yu Yen, 顏廷宇
Other Authors: Tai-Shih Chi
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/40465576549134580524
id ndltd-TW-096NCTU5435120
record_format oai_dc
spelling ndltd-TW-096NCTU54351202015-10-13T13:11:49Z http://ndltd.ncl.edu.tw/handle/40465576549134580524 Objective Assessment of Speech Quality by Perceptual Features 藉由感知特徵對語音品質做客觀的評量 Ting-Yu Yen 顏廷宇 碩士 國立交通大學 電信工程系所 96 In this study, a joint spectro-temporal auditory model was utilized to assess speech quality objectively. In this model, the first stage is to mimic early cochlear functions of the spectrum estimation and the second stage is to mimic cortical functions of the multi-dimensional spectrum analysis. The goal of this study is to predict subjective mean opinion score (MOS). Objective speech quality assessment can be done by two methods:intrusive and non-intrusive. In this study, firstly, we observe and analyze patterns of the clean speech, the noisy speech with different background noise, and the degraded speech through different codecs at two auditory stages. Secondly, we will derive an objective estimate of the MOS from data-driven perceptual parameters which are believed to reflect people’s judgment on speech quality. Four perceptual parameters considered are intelligibility, naturalness, and pitch distortion. Finally, we use multiple regression analysis to combine the relationship between speech quality and these perceptual parameters, and then obtain our predicted MOS. We then demonstrate the MOS can be characterized quickly and reliably by these three perceptual features. Tai-Shih Chi 冀泰石 2008 學位論文 ; thesis 64 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程系所 === 96 === In this study, a joint spectro-temporal auditory model was utilized to assess speech quality objectively. In this model, the first stage is to mimic early cochlear functions of the spectrum estimation and the second stage is to mimic cortical functions of the multi-dimensional spectrum analysis. The goal of this study is to predict subjective mean opinion score (MOS). Objective speech quality assessment can be done by two methods:intrusive and non-intrusive. In this study, firstly, we observe and analyze patterns of the clean speech, the noisy speech with different background noise, and the degraded speech through different codecs at two auditory stages. Secondly, we will derive an objective estimate of the MOS from data-driven perceptual parameters which are believed to reflect people’s judgment on speech quality. Four perceptual parameters considered are intelligibility, naturalness, and pitch distortion. Finally, we use multiple regression analysis to combine the relationship between speech quality and these perceptual parameters, and then obtain our predicted MOS. We then demonstrate the MOS can be characterized quickly and reliably by these three perceptual features.
author2 Tai-Shih Chi
author_facet Tai-Shih Chi
Ting-Yu Yen
顏廷宇
author Ting-Yu Yen
顏廷宇
spellingShingle Ting-Yu Yen
顏廷宇
Objective Assessment of Speech Quality by Perceptual Features
author_sort Ting-Yu Yen
title Objective Assessment of Speech Quality by Perceptual Features
title_short Objective Assessment of Speech Quality by Perceptual Features
title_full Objective Assessment of Speech Quality by Perceptual Features
title_fullStr Objective Assessment of Speech Quality by Perceptual Features
title_full_unstemmed Objective Assessment of Speech Quality by Perceptual Features
title_sort objective assessment of speech quality by perceptual features
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/40465576549134580524
work_keys_str_mv AT tingyuyen objectiveassessmentofspeechqualitybyperceptualfeatures
AT yántíngyǔ objectiveassessmentofspeechqualitybyperceptualfeatures
AT tingyuyen jíyóugǎnzhītèzhēngduìyǔyīnpǐnzhìzuòkèguāndepíngliàng
AT yántíngyǔ jíyóugǎnzhītèzhēngduìyǔyīnpǐnzhìzuòkèguāndepíngliàng
_version_ 1717732975756443648