A study on voice conversion with its application to CELP coder

碩士 === 國立臺北科技大學 === 電腦通訊與控制研究所 === 88 === In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the o...

Full description

Bibliographic Details
Main Author: 翁正平
Other Authors: 簡福榮
Format: Others
Language:zh-TW
Published: 2000
Online Access:http://ndltd.ncl.edu.tw/handle/64143139390334949700
id ndltd-TW-088TIT00652016
record_format oai_dc
spelling ndltd-TW-088TIT006520162016-01-29T04:19:17Z http://ndltd.ncl.edu.tw/handle/64143139390334949700 A study on voice conversion with its application to CELP coder 具語者轉換功能之語音編碼器 翁正平 碩士 國立臺北科技大學 電腦通訊與控制研究所 88 In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the other is the short-time spectrum. In the former, we compare the PSOLA technique and propose a new and much better algorithm for pitch modification without suffering the problem of phase discontinuity. In the latter, the vocal tract areas are found to be efficient in representing the change of short-time spectrum information. In addition, the voice conversion system is built inside a CELP-based speech coder for the purpose of speaker security and low bit-rate requirement in communication networks. The result shows that the integrated system can easily convert the voice speech from one person to that of another unknown person by tuning the acoustic parameters mentioned above. It still takes advantage of noise suppression, while allowing additional benefits from reducing the unnatural impulse train due to voice conversion. Finally, the subjective MOS test is performed to measure the quality. 簡福榮 2000 學位論文 ; thesis 0 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北科技大學 === 電腦通訊與控制研究所 === 88 === In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the other is the short-time spectrum. In the former, we compare the PSOLA technique and propose a new and much better algorithm for pitch modification without suffering the problem of phase discontinuity. In the latter, the vocal tract areas are found to be efficient in representing the change of short-time spectrum information. In addition, the voice conversion system is built inside a CELP-based speech coder for the purpose of speaker security and low bit-rate requirement in communication networks. The result shows that the integrated system can easily convert the voice speech from one person to that of another unknown person by tuning the acoustic parameters mentioned above. It still takes advantage of noise suppression, while allowing additional benefits from reducing the unnatural impulse train due to voice conversion. Finally, the subjective MOS test is performed to measure the quality.
author2 簡福榮
author_facet 簡福榮
翁正平
author 翁正平
spellingShingle 翁正平
A study on voice conversion with its application to CELP coder
author_sort 翁正平
title A study on voice conversion with its application to CELP coder
title_short A study on voice conversion with its application to CELP coder
title_full A study on voice conversion with its application to CELP coder
title_fullStr A study on voice conversion with its application to CELP coder
title_full_unstemmed A study on voice conversion with its application to CELP coder
title_sort study on voice conversion with its application to celp coder
publishDate 2000
url http://ndltd.ncl.edu.tw/handle/64143139390334949700
work_keys_str_mv AT wēngzhèngpíng astudyonvoiceconversionwithitsapplicationtocelpcoder
AT wēngzhèngpíng jùyǔzhězhuǎnhuàngōngnéngzhīyǔyīnbiānmǎqì
AT wēngzhèngpíng studyonvoiceconversionwithitsapplicationtocelpcoder
_version_ 1718168786291392512