Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method

碩士 === 亞洲大學 === 資訊傳播學系 === 104 === Speech and noise signals are mixed in the same channel in speech communication. A speech enhancement system is employed to remove corruption noise, enabling a listener to understand the meaning of received speech. The accuracy of noise estimation significantly affe...

Full description

Bibliographic Details
Main Authors: WU, WEI-LI, 吳威俐
Other Authors: LU, CHING-TA
Format: Others
Language:zh-TW
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/k63t7c
id ndltd-TW-104THMU0676010
record_format oai_dc
spelling ndltd-TW-104THMU06760102019-06-27T05:26:00Z http://ndltd.ncl.edu.tw/handle/k63t7c Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method 在零值音框嵌入法中使用諧波特性調適峰值頻譜鎖定長度於語音增強之研究 WU, WEI-LI 吳威俐 碩士 亞洲大學 資訊傳播學系 104 Speech and noise signals are mixed in the same channel in speech communication. A speech enhancement system is employed to remove corruption noise, enabling a listener to understand the meaning of received speech. The accuracy of noise estimation significantly affects the performance of enhanced speech. This thesis proposes using a frame-zero padding and peak-spectrum-holding methods adapted by harmonic properties to improve the accuracy of noise estimation. Because speech signals are absent during the zero-padded frames, we can estimate the magnitude of noise spectrum during these periods. In order to improve the performance of frame-zero padding method, robust harmonics in a vowel frame is estimated and employed to adapt the segment length for noise estimation. In the case of a non-vowel frame, the segment length is increased to adequately over-estimate the noise magnitude by the peak-spectrum holding method. So the residual noise can be significantly reduced in enhanced speech. On the contrary, the noise estimate is updated instantaneously during a vowel period, so the noise estimate can be prevented from over-estimation obtained by the peak-spectrum holding method. Accordingly, enhanced speech does not suffer from serious speech distortion. Experimental results show that the proposed method can efficiently remove background and residual noise during speech pause regions, enabling enhanced speech to sound distinct and comfortable. LU, CHING-TA 陸清達 2016 學位論文 ; thesis 73 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 亞洲大學 === 資訊傳播學系 === 104 === Speech and noise signals are mixed in the same channel in speech communication. A speech enhancement system is employed to remove corruption noise, enabling a listener to understand the meaning of received speech. The accuracy of noise estimation significantly affects the performance of enhanced speech. This thesis proposes using a frame-zero padding and peak-spectrum-holding methods adapted by harmonic properties to improve the accuracy of noise estimation. Because speech signals are absent during the zero-padded frames, we can estimate the magnitude of noise spectrum during these periods. In order to improve the performance of frame-zero padding method, robust harmonics in a vowel frame is estimated and employed to adapt the segment length for noise estimation. In the case of a non-vowel frame, the segment length is increased to adequately over-estimate the noise magnitude by the peak-spectrum holding method. So the residual noise can be significantly reduced in enhanced speech. On the contrary, the noise estimate is updated instantaneously during a vowel period, so the noise estimate can be prevented from over-estimation obtained by the peak-spectrum holding method. Accordingly, enhanced speech does not suffer from serious speech distortion. Experimental results show that the proposed method can efficiently remove background and residual noise during speech pause regions, enabling enhanced speech to sound distinct and comfortable.
author2 LU, CHING-TA
author_facet LU, CHING-TA
WU, WEI-LI
吳威俐
author WU, WEI-LI
吳威俐
spellingShingle WU, WEI-LI
吳威俐
Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
author_sort WU, WEI-LI
title Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
title_short Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
title_full Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
title_fullStr Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
title_full_unstemmed Speech Enhancement Using Variable Peak-Spectrum-Holding Length Adapted by Harmonic Properties for Frame-Zero-Padding Method
title_sort speech enhancement using variable peak-spectrum-holding length adapted by harmonic properties for frame-zero-padding method
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/k63t7c
work_keys_str_mv AT wuweili speechenhancementusingvariablepeakspectrumholdinglengthadaptedbyharmonicpropertiesforframezeropaddingmethod
AT wúwēilì speechenhancementusingvariablepeakspectrumholdinglengthadaptedbyharmonicpropertiesforframezeropaddingmethod
AT wuweili zàilíngzhíyīnkuāngqiànrùfǎzhōngshǐyòngxiébōtèxìngdiàoshìfēngzhípínpǔsuǒdìngzhǎngdùyúyǔyīnzēngqiángzhīyánjiū
AT wúwēilì zàilíngzhíyīnkuāngqiànrùfǎzhōngshǐyòngxiébōtèxìngdiàoshìfēngzhípínpǔsuǒdìngzhǎngdùyúyǔyīnzēngqiángzhīyánjiū
_version_ 1719211515665448960