Voice Activity Detection based on Harmonic Structure in Keyword Listening Application

碩士 === 國立交通大學 === 電控工程研究所 === 103 === This thesis proposes a new voice activity detection (VAD) algorithm which is based on harmonic structure feature in keyword listening application. Harmonic structure is a feature that using the periodicity of energy in frequency domain. This approach searches th...

Full description

Bibliographic Details
Main Authors: Chien, Yu-Hsuan, 簡佑軒
Other Authors: Hu Jwu-Sheng
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/11510830911622734593
Description
Summary:碩士 === 國立交通大學 === 電控工程研究所 === 103 === This thesis proposes a new voice activity detection (VAD) algorithm which is based on harmonic structure feature in keyword listening application. Harmonic structure is a feature that using the periodicity of energy in frequency domain. This approach searches the obvious part of harmonic structure in frequency domain as speech feature, and check the continuity of harmonic structure in time domain as VAD decision rule. The proposed algorithm is tested under different types of non-stationary noises and different SNR condition. Experimental results demonstrate its advantages over other VADs such as G.729, long-term spectral divergence (LTSD) and Gaussian mixture model (GMM).