A study on voice conversion with its application to CELP coder
碩士 === 國立臺北科技大學 === 電腦通訊與控制研究所 === 88 === In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the o...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2000
|
Online Access: | http://ndltd.ncl.edu.tw/handle/64143139390334949700 |
id |
ndltd-TW-088TIT00652016 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-088TIT006520162016-01-29T04:19:17Z http://ndltd.ncl.edu.tw/handle/64143139390334949700 A study on voice conversion with its application to CELP coder 具語者轉換功能之語音編碼器 翁正平 碩士 國立臺北科技大學 電腦通訊與控制研究所 88 In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the other is the short-time spectrum. In the former, we compare the PSOLA technique and propose a new and much better algorithm for pitch modification without suffering the problem of phase discontinuity. In the latter, the vocal tract areas are found to be efficient in representing the change of short-time spectrum information. In addition, the voice conversion system is built inside a CELP-based speech coder for the purpose of speaker security and low bit-rate requirement in communication networks. The result shows that the integrated system can easily convert the voice speech from one person to that of another unknown person by tuning the acoustic parameters mentioned above. It still takes advantage of noise suppression, while allowing additional benefits from reducing the unnatural impulse train due to voice conversion. Finally, the subjective MOS test is performed to measure the quality. 簡福榮 2000 學位論文 ; thesis 0 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺北科技大學 === 電腦通訊與控制研究所 === 88 === In this thesis, a voice conversion system is developed to modify the speech signal of one speaker so that it sounds like that of another. Generally, two most important acoustic parameters are adopted for speaker adaptation. One is the pitch delay and the other is the short-time spectrum. In the former, we compare the PSOLA technique and propose a new and much better algorithm for pitch modification without suffering the problem of phase discontinuity. In the latter, the vocal tract areas are found to be efficient in representing the change of short-time spectrum information. In addition, the voice conversion system is built inside a CELP-based speech coder for the purpose of speaker security and low bit-rate requirement in communication networks. The result shows that the integrated system can easily convert the voice speech from one person to that of another unknown person by tuning the acoustic parameters mentioned above. It still takes advantage of noise suppression, while allowing additional benefits from reducing the unnatural impulse train due to voice conversion. Finally, the subjective MOS test is performed to measure the quality.
|
author2 |
簡福榮 |
author_facet |
簡福榮 翁正平 |
author |
翁正平 |
spellingShingle |
翁正平 A study on voice conversion with its application to CELP coder |
author_sort |
翁正平 |
title |
A study on voice conversion with its application to CELP coder |
title_short |
A study on voice conversion with its application to CELP coder |
title_full |
A study on voice conversion with its application to CELP coder |
title_fullStr |
A study on voice conversion with its application to CELP coder |
title_full_unstemmed |
A study on voice conversion with its application to CELP coder |
title_sort |
study on voice conversion with its application to celp coder |
publishDate |
2000 |
url |
http://ndltd.ncl.edu.tw/handle/64143139390334949700 |
work_keys_str_mv |
AT wēngzhèngpíng astudyonvoiceconversionwithitsapplicationtocelpcoder AT wēngzhèngpíng jùyǔzhězhuǎnhuàngōngnéngzhīyǔyīnbiānmǎqì AT wēngzhèngpíng studyonvoiceconversionwithitsapplicationtocelpcoder |
_version_ |
1718168786291392512 |