Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment

碩士 === 國立宜蘭大學 === 電子工程學系碩士班 === 102 === In recent years, smart home appliances and mobile devices have increasingly become prevalent. This study is focused on the smart TV structure, especially the front-end voice signal processing technology. It aims at developing an efficient voice activity detect...

Full description

Bibliographic Details
Main Authors: Chen, Szu-Hong, 陳思宏
Other Authors: Hu, Hwai-Tsu
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/50557572767591710285
id ndltd-TW-102NIU00428009
record_format oai_dc
spelling ndltd-TW-102NIU004280092016-05-22T04:40:14Z http://ndltd.ncl.edu.tw/handle/50557572767591710285 Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment 於低訊噪比環境下之雙參數語音活動偵測與短暫噪音分類 Chen, Szu-Hong 陳思宏 碩士 國立宜蘭大學 電子工程學系碩士班 102 In recent years, smart home appliances and mobile devices have increasingly become prevalent. This study is focused on the smart TV structure, especially the front-end voice signal processing technology. It aims at developing an efficient voice activity detection (VAD) algorithm to discriminate the voice from noise via noise cancellation and support vector machine (SVM) techniques, thus providing preferable voice signals for follow-up applications. This thesis mainly contains three parts. First, two voice-related parameters, namely frame energy and spectral entropy, are employed as the basis of judgment. These two parameters have value fluctuation, and are unstable in signal analysis, causing incorrect judgment in VAD. Thus, they are combined into one vector to control the fluctuation. This can facilitate judgment in the VAD. As a result, voiced and silence frames can be distinguished effectively. Second, in order to ensure that the voice application has a better performance, noise must be canceled. Thus, the results of the VAD are directly utilized to identify silence frames that consist of merely background noise for deriving the adaptive noise cancellation filter. Third, because the separation between the transient noise and voice signal is difficult for conventional VAD algorithms, this study resorts to the learning and classification capability of the SVM to distinguish the transient noise from voice signals. In the experimental setup we employed the white and babble noise samples as the background noise together with four types of transient noise. The efficiency of the VAD was evaluated based on the detection accuracy and perceptual evaluation of speech quality (PESQ) measures. The discussion thus included the effect due to background noise cancellation as well as SVM classification. The experimental results showed that the proposed two-parameter VAD algorithm renders a better performance when the babble noise is present with low signal-to-noise ratios. In comparison with the compared object, the computational complexity of the two-parameter algorithm is relatively low and therefore suitable for small mobile devices. As also revealed by the experimental results, the background noise has significant influence on classification results, suggesting that cancelling background noise will avail the performance improvement of subsequent applications. Hu, Hwai-Tsu 胡懷祖 2014 學位論文 ; thesis 75 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立宜蘭大學 === 電子工程學系碩士班 === 102 === In recent years, smart home appliances and mobile devices have increasingly become prevalent. This study is focused on the smart TV structure, especially the front-end voice signal processing technology. It aims at developing an efficient voice activity detection (VAD) algorithm to discriminate the voice from noise via noise cancellation and support vector machine (SVM) techniques, thus providing preferable voice signals for follow-up applications. This thesis mainly contains three parts. First, two voice-related parameters, namely frame energy and spectral entropy, are employed as the basis of judgment. These two parameters have value fluctuation, and are unstable in signal analysis, causing incorrect judgment in VAD. Thus, they are combined into one vector to control the fluctuation. This can facilitate judgment in the VAD. As a result, voiced and silence frames can be distinguished effectively. Second, in order to ensure that the voice application has a better performance, noise must be canceled. Thus, the results of the VAD are directly utilized to identify silence frames that consist of merely background noise for deriving the adaptive noise cancellation filter. Third, because the separation between the transient noise and voice signal is difficult for conventional VAD algorithms, this study resorts to the learning and classification capability of the SVM to distinguish the transient noise from voice signals. In the experimental setup we employed the white and babble noise samples as the background noise together with four types of transient noise. The efficiency of the VAD was evaluated based on the detection accuracy and perceptual evaluation of speech quality (PESQ) measures. The discussion thus included the effect due to background noise cancellation as well as SVM classification. The experimental results showed that the proposed two-parameter VAD algorithm renders a better performance when the babble noise is present with low signal-to-noise ratios. In comparison with the compared object, the computational complexity of the two-parameter algorithm is relatively low and therefore suitable for small mobile devices. As also revealed by the experimental results, the background noise has significant influence on classification results, suggesting that cancelling background noise will avail the performance improvement of subsequent applications.
author2 Hu, Hwai-Tsu
author_facet Hu, Hwai-Tsu
Chen, Szu-Hong
陳思宏
author Chen, Szu-Hong
陳思宏
spellingShingle Chen, Szu-Hong
陳思宏
Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment
author_sort Chen, Szu-Hong
title Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment
title_short Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment
title_full Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment
title_fullStr Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment
title_full_unstemmed Two - Features Voice Activity Detection and Transient Noise Classification in Low SNR Environment
title_sort two - features voice activity detection and transient noise classification in low snr environment
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/50557572767591710285
work_keys_str_mv AT chenszuhong twofeaturesvoiceactivitydetectionandtransientnoiseclassificationinlowsnrenvironment
AT chénsīhóng twofeaturesvoiceactivitydetectionandtransientnoiseclassificationinlowsnrenvironment
AT chenszuhong yúdīxùnzàobǐhuánjìngxiàzhīshuāngcānshùyǔyīnhuódòngzhēncèyǔduǎnzànzàoyīnfēnlèi
AT chénsīhóng yúdīxùnzàobǐhuánjìngxiàzhīshuāngcānshùyǔyīnhuódòngzhēncèyǔduǎnzànzàoyīnfēnlèi
_version_ 1718275649377927168