Phoneme and Sentence-Level Ensembles for Speech Recognition

<p/> <p>We address the question of whether and how boosting and bagging can be used for speech recognition. In order to do this, we compare two different boosting schemes, one at the phoneme level and one at the utterance level, with a phoneme-level bagging scheme. We control for many pa...

Full description

Bibliographic Details
Main Authors: Bengio Samy, Dimitrakakis Christos
Format: Article
Language:English
Published: SpringerOpen 2011-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://asmp.eurasipjournals.com/content/2011/426792
id doaj-bec7c6036de54422b16ffc66fd1a6ae8
record_format Article
spelling doaj-bec7c6036de54422b16ffc66fd1a6ae82020-11-25T01:49:11ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222011-01-0120111426792Phoneme and Sentence-Level Ensembles for Speech RecognitionBengio SamyDimitrakakis Christos<p/> <p>We address the question of whether and how boosting and bagging can be used for speech recognition. In order to do this, we compare two different boosting schemes, one at the phoneme level and one at the utterance level, with a phoneme-level bagging scheme. We control for many parameters and other choices, such as the state inference scheme used. In an unbiased experiment, we clearly show that the gain of boosting methods compared to a single hidden Markov model is in all cases only marginal, while bagging significantly outperforms all other methods. We thus conclude that bagging methods, which have so far been overlooked in favour of boosting, should be examined more closely as a potentially useful ensemble learning technique for speech recognition.</p>http://asmp.eurasipjournals.com/content/2011/426792
collection DOAJ
language English
format Article
sources DOAJ
author Bengio Samy
Dimitrakakis Christos
spellingShingle Bengio Samy
Dimitrakakis Christos
Phoneme and Sentence-Level Ensembles for Speech Recognition
EURASIP Journal on Audio, Speech, and Music Processing
author_facet Bengio Samy
Dimitrakakis Christos
author_sort Bengio Samy
title Phoneme and Sentence-Level Ensembles for Speech Recognition
title_short Phoneme and Sentence-Level Ensembles for Speech Recognition
title_full Phoneme and Sentence-Level Ensembles for Speech Recognition
title_fullStr Phoneme and Sentence-Level Ensembles for Speech Recognition
title_full_unstemmed Phoneme and Sentence-Level Ensembles for Speech Recognition
title_sort phoneme and sentence-level ensembles for speech recognition
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2011-01-01
description <p/> <p>We address the question of whether and how boosting and bagging can be used for speech recognition. In order to do this, we compare two different boosting schemes, one at the phoneme level and one at the utterance level, with a phoneme-level bagging scheme. We control for many parameters and other choices, such as the state inference scheme used. In an unbiased experiment, we clearly show that the gain of boosting methods compared to a single hidden Markov model is in all cases only marginal, while bagging significantly outperforms all other methods. We thus conclude that bagging methods, which have so far been overlooked in favour of boosting, should be examined more closely as a potentially useful ensemble learning technique for speech recognition.</p>
url http://asmp.eurasipjournals.com/content/2011/426792
work_keys_str_mv AT bengiosamy phonemeandsentencelevelensemblesforspeechrecognition
AT dimitrakakischristos phonemeandsentencelevelensemblesforspeechrecognition
_version_ 1725008173986742272