Speech quality evaluation using digital watermarking
Speech quality evaluation is a very important research topic. The Mean Opinion Score (MOS) is reliable but the listening test is very expensive, time consuming, and even impractical for some applications. Objective quality evaluation methods require either the original speech or a complicated comput...
Main Author: | |
---|---|
Format: | Others |
Language: | en |
Published: |
University of Ottawa (Canada)
2013
|
Subjects: | |
Online Access: | http://hdl.handle.net/10393/27115 http://dx.doi.org/10.20381/ruor-11922 |
id |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-27115 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-271152018-01-05T19:07:24Z Speech quality evaluation using digital watermarking Cai, Libin Computer Science. Speech quality evaluation is a very important research topic. The Mean Opinion Score (MOS) is reliable but the listening test is very expensive, time consuming, and even impractical for some applications. Objective quality evaluation methods require either the original speech or a complicated computation model, which makes some applications of quality evaluation impossible. Different from the perceptual model used by the Perceptual Evaluation of Speech Quality (PESQ), in this thesis, we propose to use digital audio watermarking to evaluate the quality of speech. Based on quantization, watermark bits are embedded and extracted in the Discrete Wavelet Transform (DWT) domain. By comparing the original and the extracted watermark, we predict the quality of speech that has undergone MP3 compression, Gaussian noise addition, low-pass filtering, or packet loss. Our quality evaluation method does not need the original signal or a computation model. For the quality evaluation, we use the PESQ MOS as a reference. We predict the speech quality from the PCEW (Percentage of Correctly Extracted Watermark bits) based on the mapping between ITU-T P.862 PESQ MOS and the PCEW. To evaluate the performance of our objective quality evaluation method, we introduce the correlation coefficient and residual error to evaluate the correlation between the predicted MOS and the PESQ MOS. The experiments show that the method yields very promising evaluation results which are very close to the results of the PESQ. 2013-11-07T18:13:02Z 2013-11-07T18:13:02Z 2006 2006 Thesis Source: Masters Abstracts International, Volume: 44-06, page: 2834. http://hdl.handle.net/10393/27115 http://dx.doi.org/10.20381/ruor-11922 en 86 p. University of Ottawa (Canada) |
collection |
NDLTD |
language |
en |
format |
Others
|
sources |
NDLTD |
topic |
Computer Science. |
spellingShingle |
Computer Science. Cai, Libin Speech quality evaluation using digital watermarking |
description |
Speech quality evaluation is a very important research topic. The Mean Opinion Score (MOS) is reliable but the listening test is very expensive, time consuming, and even impractical for some applications. Objective quality evaluation methods require either the original speech or a complicated computation model, which makes some applications of quality evaluation impossible.
Different from the perceptual model used by the Perceptual Evaluation of Speech Quality (PESQ), in this thesis, we propose to use digital audio watermarking to evaluate the quality of speech. Based on quantization, watermark bits are embedded and extracted in the Discrete Wavelet Transform (DWT) domain. By comparing the original and the extracted watermark, we predict the quality of speech that has undergone MP3 compression, Gaussian noise addition, low-pass filtering, or packet loss. Our quality evaluation method does not need the original signal or a computation model.
For the quality evaluation, we use the PESQ MOS as a reference. We predict the speech quality from the PCEW (Percentage of Correctly Extracted Watermark bits) based on the mapping between ITU-T P.862 PESQ MOS and the PCEW. To evaluate the performance of our objective quality evaluation method, we introduce the correlation coefficient and residual error to evaluate the correlation between the predicted MOS and the PESQ MOS. The experiments show that the method yields very promising evaluation results which are very close to the results of the PESQ. |
author |
Cai, Libin |
author_facet |
Cai, Libin |
author_sort |
Cai, Libin |
title |
Speech quality evaluation using digital watermarking |
title_short |
Speech quality evaluation using digital watermarking |
title_full |
Speech quality evaluation using digital watermarking |
title_fullStr |
Speech quality evaluation using digital watermarking |
title_full_unstemmed |
Speech quality evaluation using digital watermarking |
title_sort |
speech quality evaluation using digital watermarking |
publisher |
University of Ottawa (Canada) |
publishDate |
2013 |
url |
http://hdl.handle.net/10393/27115 http://dx.doi.org/10.20381/ruor-11922 |
work_keys_str_mv |
AT cailibin speechqualityevaluationusingdigitalwatermarking |
_version_ |
1718602170601832448 |