A study on voice conversion with its application to CELP coder

碩士 === 國立臺北科技大學 === 電腦通訊與控制研究所 === 88 === In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the o...

Full description

Bibliographic Details
Main Author:	翁正平
Other Authors:	簡福榮
Format:	Others
Language:	zh-TW
Published:	2000
Online Access:	http://ndltd.ncl.edu.tw/handle/64143139390334949700

id	ndltd-TW-088TIT00652016
record_format	oai_dc
spelling	ndltd-TW-088TIT006520162016-01-29T04:19:17Z http://ndltd.ncl.edu.tw/handle/64143139390334949700 A study on voice conversion with its application to CELP coder 具語者轉換功能之語音編碼器翁正平碩士國立臺北科技大學電腦通訊與控制研究所 88 In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the other is the short-time spectrum. In the former, we compare the PSOLA technique and propose a new and much better algorithm for pitch modification without suffering the problem of phase discontinuity. In the latter, the vocal tract areas are found to be efficient in representing the change of short-time spectrum information. In addition, the voice conversion system is built inside a CELP-based speech coder for the purpose of speaker security and low bit-rate requirement in communication networks. The result shows that the integrated system can easily convert the voice speech from one person to that of another unknown person by tuning the acoustic parameters mentioned above. It still takes advantage of noise suppression, while allowing additional benefits from reducing the unnatural impulse train due to voice conversion. Finally, the subjective MOS test is performed to measure the quality. 簡福榮 2000 學位論文 ; thesis 0 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 國立臺北科技大學 === 電腦通訊與控制研究所 === 88 === In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the other is the short-time spectrum. In the former, we compare the PSOLA technique and propose a new and much better algorithm for pitch modification without suffering the problem of phase discontinuity. In the latter, the vocal tract areas are found to be efficient in representing the change of short-time spectrum information. In addition, the voice conversion system is built inside a CELP-based speech coder for the purpose of speaker security and low bit-rate requirement in communication networks. The result shows that the integrated system can easily convert the voice speech from one person to that of another unknown person by tuning the acoustic parameters mentioned above. It still takes advantage of noise suppression, while allowing additional benefits from reducing the unnatural impulse train due to voice conversion. Finally, the subjective MOS test is performed to measure the quality.
author2	簡福榮
author_facet	簡福榮翁正平
author	翁正平
spellingShingle	翁正平 A study on voice conversion with its application to CELP coder
author_sort	翁正平
title	A study on voice conversion with its application to CELP coder
title_short	A study on voice conversion with its application to CELP coder
title_full	A study on voice conversion with its application to CELP coder
title_fullStr	A study on voice conversion with its application to CELP coder
title_full_unstemmed	A study on voice conversion with its application to CELP coder
title_sort	study on voice conversion with its application to celp coder
publishDate	2000
url	http://ndltd.ncl.edu.tw/handle/64143139390334949700
work_keys_str_mv	AT wēngzhèngpíng astudyonvoiceconversionwithitsapplicationtocelpcoder AT wēngzhèngpíng jùyǔzhězhuǎnhuàngōngnéngzhīyǔyīnbiānmǎqì AT wēngzhèngpíng studyonvoiceconversionwithitsapplicationtocelpcoder
_version_	1718168786291392512

A study on voice conversion with its application to CELP coder

Similar Items