Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
碩士 === 亞洲大學 === 資訊傳播學系 === 104 === Speech and noise signals are mixed in the same channel in speech communication. A speech enhancement system is employed to remove corruption noise, enabling a listener to understand the meaning of received speech. The accuracy of noise estimation significantly affe...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/k63t7c |
id |
ndltd-TW-104THMU0676010 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104THMU06760102019-06-27T05:26:00Z http://ndltd.ncl.edu.tw/handle/k63t7c Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method 在零值音框嵌入法中使用諧波特性調適峰值頻譜鎖定長度於語音增強之研究 WU, WEI-LI 吳威俐 碩士 亞洲大學 資訊傳播學系 104 Speech and noise signals are mixed in the same channel in speech communication. A speech enhancement system is employed to remove corruption noise, enabling a listener to understand the meaning of received speech. The accuracy of noise estimation significantly affects the performance of enhanced speech. This thesis proposes using a frame-zero padding and peak-spectrum-holding methods adapted by harmonic properties to improve the accuracy of noise estimation. Because speech signals are absent during the zero-padded frames, we can estimate the magnitude of noise spectrum during these periods. In order to improve the performance of frame-zero padding method, robust harmonics in a vowel frame is estimated and employed to adapt the segment length for noise estimation. In the case of a non-vowel frame, the segment length is increased to adequately over-estimate the noise magnitude by the peak-spectrum holding method. So the residual noise can be significantly reduced in enhanced speech. On the contrary, the noise estimate is updated instantaneously during a vowel period, so the noise estimate can be prevented from over-estimation obtained by the peak-spectrum holding method. Accordingly, enhanced speech does not suffer from serious speech distortion. Experimental results show that the proposed method can efficiently remove background and residual noise during speech pause regions, enabling enhanced speech to sound distinct and comfortable. LU, CHING-TA 陸清達 2016 學位論文 ; thesis 73 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 亞洲大學 === 資訊傳播學系 === 104 === Speech and noise signals are mixed in the same channel in speech communication. A speech enhancement system is employed to remove corruption noise, enabling a listener to understand the meaning of received speech. The accuracy of noise estimation significantly affects the performance of enhanced speech. This thesis proposes using a frame-zero padding and peak-spectrum-holding methods adapted by harmonic properties to improve the accuracy of noise estimation. Because speech signals are absent during the zero-padded frames, we can estimate the magnitude of noise spectrum during these periods. In order to improve the performance of frame-zero padding method, robust harmonics in a vowel frame is estimated and employed to adapt the segment length for noise estimation. In the case of a non-vowel frame, the segment length is increased to adequately over-estimate the noise magnitude by the peak-spectrum holding method. So the residual noise can be significantly reduced in enhanced speech. On the contrary, the noise estimate is updated instantaneously during a vowel period, so the noise estimate can be prevented from over-estimation obtained by the peak-spectrum holding method. Accordingly, enhanced speech does not suffer from serious speech distortion. Experimental results show that the proposed method can efficiently remove background and residual noise during speech pause regions, enabling enhanced speech to sound distinct and comfortable.
|
author2 |
LU, CHING-TA |
author_facet |
LU, CHING-TA WU, WEI-LI 吳威俐 |
author |
WU, WEI-LI 吳威俐 |
spellingShingle |
WU, WEI-LI 吳威俐 Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method |
author_sort |
WU, WEI-LI |
title |
Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method |
title_short |
Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method |
title_full |
Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method |
title_fullStr |
Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method |
title_full_unstemmed |
Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method |
title_sort |
speech enhancement using variable peak-spectrum-holding length adapted by harmonic properties for frame-zero-padding method |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/k63t7c |
work_keys_str_mv |
AT wuweili speechenhancementusingvariablepeakspectrumholdinglengthadaptedbyharmonicpropertiesforframezeropaddingmethod AT wúwēilì speechenhancementusingvariablepeakspectrumholdinglengthadaptedbyharmonicpropertiesforframezeropaddingmethod AT wuweili zàilíngzhíyīnkuāngqiànrùfǎzhōngshǐyòngxiébōtèxìngdiàoshìfēngzhípínpǔsuǒdìngzhǎngdùyúyǔyīnzēngqiángzhīyánjiū AT wúwēilì zàilíngzhíyīnkuāngqiànrùfǎzhōngshǐyòngxiébōtèxìngdiàoshìfēngzhípínpǔsuǒdìngzhǎngdùyúyǔyīnzēngqiángzhīyánjiū |
_version_ |
1719211515665448960 |