Application of Deep Belief Network on Binaural Speech Separation and Dereverberation

碩士 === 國立交通大學 === 電信工程研究所 === 103 === Binaural speech separation and de-reverberation are popular research topics and we have developed an unsupervised clustering method for these purposes. In this thesis, we adopt a supervised classification method for binaural speech separation and de-reverberatio...

Full description

Bibliographic Details
Main Authors: Chen, Yi-Ting, 陳奕廷
Other Authors: Chi, Tai-Shih
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/34100240484476159416
id ndltd-TW-103NCTU5435138
record_format oai_dc
spelling ndltd-TW-103NCTU54351382016-08-12T04:14:06Z http://ndltd.ncl.edu.tw/handle/34100240484476159416 Application of Deep Belief Network on Binaural Speech Separation and Dereverberation 深層信念網路在雙耳語音分離及消除迴響上的應用 Chen, Yi-Ting 陳奕廷 碩士 國立交通大學 電信工程研究所 103 Binaural speech separation and de-reverberation are popular research topics and we have developed an unsupervised clustering method for these purposes. In this thesis, we adopt a supervised classification method for binaural speech separation and de-reverberation using the ideal binary mask (IBM) as the training target and a deep belief network (DBN) as the classifier. We extract the interaural time difference (ITD) and the interaural level difference (ILD) of each T-F unit as the binaural features. To boost the performance of de-reverberation, the interaural coherence (IC) is considered when building the target IBM. We propose three different DBN architectures, the side-by-side training (monaural training), the joint training (binaural training) and the multitask learning, and compare their binaural de-reverberation performance with the performance of our previously developed unsupervised clustering method in terms of many objective criteria. Chi, Tai-Shih 冀泰石 2015 學位論文 ; thesis 48 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 電信工程研究所 === 103 === Binaural speech separation and de-reverberation are popular research topics and we have developed an unsupervised clustering method for these purposes. In this thesis, we adopt a supervised classification method for binaural speech separation and de-reverberation using the ideal binary mask (IBM) as the training target and a deep belief network (DBN) as the classifier. We extract the interaural time difference (ITD) and the interaural level difference (ILD) of each T-F unit as the binaural features. To boost the performance of de-reverberation, the interaural coherence (IC) is considered when building the target IBM. We propose three different DBN architectures, the side-by-side training (monaural training), the joint training (binaural training) and the multitask learning, and compare their binaural de-reverberation performance with the performance of our previously developed unsupervised clustering method in terms of many objective criteria.
author2 Chi, Tai-Shih
author_facet Chi, Tai-Shih
Chen, Yi-Ting
陳奕廷
author Chen, Yi-Ting
陳奕廷
spellingShingle Chen, Yi-Ting
陳奕廷
Application of Deep Belief Network on Binaural Speech Separation and Dereverberation
author_sort Chen, Yi-Ting
title Application of Deep Belief Network on Binaural Speech Separation and Dereverberation
title_short Application of Deep Belief Network on Binaural Speech Separation and Dereverberation
title_full Application of Deep Belief Network on Binaural Speech Separation and Dereverberation
title_fullStr Application of Deep Belief Network on Binaural Speech Separation and Dereverberation
title_full_unstemmed Application of Deep Belief Network on Binaural Speech Separation and Dereverberation
title_sort application of deep belief network on binaural speech separation and dereverberation
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/34100240484476159416
work_keys_str_mv AT chenyiting applicationofdeepbeliefnetworkonbinauralspeechseparationanddereverberation
AT chényìtíng applicationofdeepbeliefnetworkonbinauralspeechseparationanddereverberation
AT chenyiting shēncéngxìnniànwǎnglùzàishuāngěryǔyīnfēnlíjíxiāochúhuíxiǎngshàngdeyīngyòng
AT chényìtíng shēncéngxìnniànwǎnglùzàishuāngěryǔyīnfēnlíjíxiāochúhuíxiǎngshàngdeyīngyòng
_version_ 1718374473519857664