Speech quality evaluation using digital watermarking

Speech quality evaluation is a very important research topic. The Mean Opinion Score (MOS) is reliable but the listening test is very expensive, time consuming, and even impractical for some applications. Objective quality evaluation methods require either the original speech or a complicated comput...

Full description

Bibliographic Details
Main Author: Cai, Libin
Format: Others
Language:en
Published: University of Ottawa (Canada) 2013
Subjects:
Online Access:http://hdl.handle.net/10393/27115
http://dx.doi.org/10.20381/ruor-11922
id ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-27115
record_format oai_dc
spelling ndltd-uottawa.ca-oai-ruor.uottawa.ca-10393-271152018-01-05T19:07:24Z Speech quality evaluation using digital watermarking Cai, Libin Computer Science. Speech quality evaluation is a very important research topic. The Mean Opinion Score (MOS) is reliable but the listening test is very expensive, time consuming, and even impractical for some applications. Objective quality evaluation methods require either the original speech or a complicated computation model, which makes some applications of quality evaluation impossible. Different from the perceptual model used by the Perceptual Evaluation of Speech Quality (PESQ), in this thesis, we propose to use digital audio watermarking to evaluate the quality of speech. Based on quantization, watermark bits are embedded and extracted in the Discrete Wavelet Transform (DWT) domain. By comparing the original and the extracted watermark, we predict the quality of speech that has undergone MP3 compression, Gaussian noise addition, low-pass filtering, or packet loss. Our quality evaluation method does not need the original signal or a computation model. For the quality evaluation, we use the PESQ MOS as a reference. We predict the speech quality from the PCEW (Percentage of Correctly Extracted Watermark bits) based on the mapping between ITU-T P.862 PESQ MOS and the PCEW. To evaluate the performance of our objective quality evaluation method, we introduce the correlation coefficient and residual error to evaluate the correlation between the predicted MOS and the PESQ MOS. The experiments show that the method yields very promising evaluation results which are very close to the results of the PESQ. 2013-11-07T18:13:02Z 2013-11-07T18:13:02Z 2006 2006 Thesis Source: Masters Abstracts International, Volume: 44-06, page: 2834. http://hdl.handle.net/10393/27115 http://dx.doi.org/10.20381/ruor-11922 en 86 p. University of Ottawa (Canada)
collection NDLTD
language en
format Others
sources NDLTD
topic Computer Science.
spellingShingle Computer Science.
Cai, Libin
Speech quality evaluation using digital watermarking
description Speech quality evaluation is a very important research topic. The Mean Opinion Score (MOS) is reliable but the listening test is very expensive, time consuming, and even impractical for some applications. Objective quality evaluation methods require either the original speech or a complicated computation model, which makes some applications of quality evaluation impossible. Different from the perceptual model used by the Perceptual Evaluation of Speech Quality (PESQ), in this thesis, we propose to use digital audio watermarking to evaluate the quality of speech. Based on quantization, watermark bits are embedded and extracted in the Discrete Wavelet Transform (DWT) domain. By comparing the original and the extracted watermark, we predict the quality of speech that has undergone MP3 compression, Gaussian noise addition, low-pass filtering, or packet loss. Our quality evaluation method does not need the original signal or a computation model. For the quality evaluation, we use the PESQ MOS as a reference. We predict the speech quality from the PCEW (Percentage of Correctly Extracted Watermark bits) based on the mapping between ITU-T P.862 PESQ MOS and the PCEW. To evaluate the performance of our objective quality evaluation method, we introduce the correlation coefficient and residual error to evaluate the correlation between the predicted MOS and the PESQ MOS. The experiments show that the method yields very promising evaluation results which are very close to the results of the PESQ.
author Cai, Libin
author_facet Cai, Libin
author_sort Cai, Libin
title Speech quality evaluation using digital watermarking
title_short Speech quality evaluation using digital watermarking
title_full Speech quality evaluation using digital watermarking
title_fullStr Speech quality evaluation using digital watermarking
title_full_unstemmed Speech quality evaluation using digital watermarking
title_sort speech quality evaluation using digital watermarking
publisher University of Ottawa (Canada)
publishDate 2013
url http://hdl.handle.net/10393/27115
http://dx.doi.org/10.20381/ruor-11922
work_keys_str_mv AT cailibin speechqualityevaluationusingdigitalwatermarking
_version_ 1718602170601832448