A Novel Independent Component Analysis for Noisy Speech Recognition

碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 95 === Independent component analysis (ICA) is a widely accepted mechanism in solving blind source separation (BSS) problem. In this study, we develop a new ICA approach for unsupervised learning and apply it for hidden Markov model (HMM) clustering and noisy speech...

Full description

Bibliographic Details
Main Authors: Chang-Kai Chao, 趙昶凱
Other Authors: Jen-Tzung Chien
Format: Others
Language:zh-TW
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/49923643104009055088
id ndltd-TW-095NCKU5392099
record_format oai_dc
spelling ndltd-TW-095NCKU53920992016-05-20T04:17:27Z http://ndltd.ncl.edu.tw/handle/49923643104009055088 A Novel Independent Component Analysis for Noisy Speech Recognition 新穎獨立成份分析應用於雜訊語音辨識 Chang-Kai Chao 趙昶凱 碩士 國立成功大學 資訊工程學系碩博士班 95 Independent component analysis (ICA) is a widely accepted mechanism in solving blind source separation (BSS) problem. In this study, we develop a new ICA approach for unsupervised learning and apply it for hidden Markov model (HMM) clustering and noisy speech recognition. The underlying concept of proposed ICA algorithm is to de-mix the HMM mean vectors and identify the corresponding mixture sources prior to HMM clustering. These independent sources represent the specific noise conditions embedded in speech signals or features. We focus on presenting a general unsupervised learning algorithm based on a new ICA objective function. We will apply this algorithm for BSS and different problems in speech recognition. In ICA procedure, we follow up a predefined objective function measuring the dependence among feature vectors and derive an optimal demixing matrix, which can minimize the measure of dependence or mutual information. The basic assumptions of ICA include the source signals being mutually independent and having non-Gaussian distribution. We are using the Jensen’s inequality to derive a new metric of dependence measure. The parametric and nonparametric ICA approaches are developed. The generalized Gaussian model is used to characterize the non-Gaussianity of an acoustic random vector. We exploit a parametric ICA using generalized Gaussian distribution and also a nonparametric ICA using the Parzen window based distribution. We evaluate the efficiency and effectiveness of the proposed objective function in finding ICA demixing matrix compared to the existing objection functions including maximum likelihood, minimum mutual information and maximum non-entropy, etc. In the experiments, we investigate the performance of BSS and noisy speech recognition. We are using this ICA method for HMM clustering and evaluating speech recognition performance on AURORA2 noisy speech database. The preliminary results show that the proposed ICA achieves faster convergence property and higher recognition rate. Jen-Tzung Chien 簡仁宗 2007 學位論文 ; thesis 105 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 95 === Independent component analysis (ICA) is a widely accepted mechanism in solving blind source separation (BSS) problem. In this study, we develop a new ICA approach for unsupervised learning and apply it for hidden Markov model (HMM) clustering and noisy speech recognition. The underlying concept of proposed ICA algorithm is to de-mix the HMM mean vectors and identify the corresponding mixture sources prior to HMM clustering. These independent sources represent the specific noise conditions embedded in speech signals or features. We focus on presenting a general unsupervised learning algorithm based on a new ICA objective function. We will apply this algorithm for BSS and different problems in speech recognition. In ICA procedure, we follow up a predefined objective function measuring the dependence among feature vectors and derive an optimal demixing matrix, which can minimize the measure of dependence or mutual information. The basic assumptions of ICA include the source signals being mutually independent and having non-Gaussian distribution. We are using the Jensen’s inequality to derive a new metric of dependence measure. The parametric and nonparametric ICA approaches are developed. The generalized Gaussian model is used to characterize the non-Gaussianity of an acoustic random vector. We exploit a parametric ICA using generalized Gaussian distribution and also a nonparametric ICA using the Parzen window based distribution. We evaluate the efficiency and effectiveness of the proposed objective function in finding ICA demixing matrix compared to the existing objection functions including maximum likelihood, minimum mutual information and maximum non-entropy, etc. In the experiments, we investigate the performance of BSS and noisy speech recognition. We are using this ICA method for HMM clustering and evaluating speech recognition performance on AURORA2 noisy speech database. The preliminary results show that the proposed ICA achieves faster convergence property and higher recognition rate.
author2 Jen-Tzung Chien
author_facet Jen-Tzung Chien
Chang-Kai Chao
趙昶凱
author Chang-Kai Chao
趙昶凱
spellingShingle Chang-Kai Chao
趙昶凱
A Novel Independent Component Analysis for Noisy Speech Recognition
author_sort Chang-Kai Chao
title A Novel Independent Component Analysis for Noisy Speech Recognition
title_short A Novel Independent Component Analysis for Noisy Speech Recognition
title_full A Novel Independent Component Analysis for Noisy Speech Recognition
title_fullStr A Novel Independent Component Analysis for Noisy Speech Recognition
title_full_unstemmed A Novel Independent Component Analysis for Noisy Speech Recognition
title_sort novel independent component analysis for noisy speech recognition
publishDate 2007
url http://ndltd.ncl.edu.tw/handle/49923643104009055088
work_keys_str_mv AT changkaichao anovelindependentcomponentanalysisfornoisyspeechrecognition
AT zhàochǎngkǎi anovelindependentcomponentanalysisfornoisyspeechrecognition
AT changkaichao xīnyǐngdúlìchéngfènfēnxīyīngyòngyúzáxùnyǔyīnbiànshí
AT zhàochǎngkǎi xīnyǐngdúlìchéngfènfēnxīyīngyòngyúzáxùnyǔyīnbiànshí
AT changkaichao novelindependentcomponentanalysisfornoisyspeechrecognition
AT zhàochǎngkǎi novelindependentcomponentanalysisfornoisyspeechrecognition
_version_ 1718272153530400768