Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining

In this paper, we present a cluster-dependent adaptation approach for HMM-based acoustic models. The proposed approach employs clustering techniques to group the original training utterances into clusters with predefined number. The clustered speech data are intended to adapt an initially pre-traine...

Full description

Bibliographic Details
Main Authors: Peter Viszlay, Marek Ecegi, Josef Juhar
Format: Article
Language:English
Published: VSB-Technical University of Ostrava 2015-01-01
Series:Advances in Electrical and Electronic Engineering
Subjects:
Online Access:http://advances.utc.sk/index.php/AEEE/article/view/1448
id doaj-aa729e7a68514e7792c52b0c39647ac4
record_format Article
spelling doaj-aa729e7a68514e7792c52b0c39647ac42021-10-11T08:03:05ZengVSB-Technical University of OstravaAdvances in Electrical and Electronic Engineering1336-13761804-31192015-01-0113429530210.15598/aeee.v13i4.1448749Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model RetrainingPeter Viszlay0Marek EcegiJosef JuharTechnical University of KošiceIn this paper, we present a cluster-dependent adaptation approach for HMM-based acoustic models. The proposed approach employs clustering techniques to group the original training utterances into clusters with predefined number. The clustered speech data are intended to adapt an initially pre-trained acoustic model to the specific cluster by reestimation based on the standard Baum-Welch procedure. The resulting model, adapted to the homogeneous data may markedly improve the baseline recognition rate, whereas the model complexity may be reduced. In the recognition step, the test samples are scored by each adapted model and the most accurate one is chosen. The proposed approach is thoroughly evaluated in Slovak triphone-based large vocabulary continuous speech recognition (LVCSR) system. The results prove that the cluster-sensitive retraining leads to significant improvements over the baseline reference system trained according to the conventional training procedure.http://advances.utc.sk/index.php/AEEE/article/view/1448acoustic modeladaptationcluster analysisreestimationweighted mean vector.
collection DOAJ
language English
format Article
sources DOAJ
author Peter Viszlay
Marek Ecegi
Josef Juhar
spellingShingle Peter Viszlay
Marek Ecegi
Josef Juhar
Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
Advances in Electrical and Electronic Engineering
acoustic model
adaptation
cluster analysis
reestimation
weighted mean vector.
author_facet Peter Viszlay
Marek Ecegi
Josef Juhar
author_sort Peter Viszlay
title Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
title_short Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
title_full Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
title_fullStr Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
title_full_unstemmed Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
title_sort improving the slovak lvcsr performance by cluster-sensitive acoustic model retraining
publisher VSB-Technical University of Ostrava
series Advances in Electrical and Electronic Engineering
issn 1336-1376
1804-3119
publishDate 2015-01-01
description In this paper, we present a cluster-dependent adaptation approach for HMM-based acoustic models. The proposed approach employs clustering techniques to group the original training utterances into clusters with predefined number. The clustered speech data are intended to adapt an initially pre-trained acoustic model to the specific cluster by reestimation based on the standard Baum-Welch procedure. The resulting model, adapted to the homogeneous data may markedly improve the baseline recognition rate, whereas the model complexity may be reduced. In the recognition step, the test samples are scored by each adapted model and the most accurate one is chosen. The proposed approach is thoroughly evaluated in Slovak triphone-based large vocabulary continuous speech recognition (LVCSR) system. The results prove that the cluster-sensitive retraining leads to significant improvements over the baseline reference system trained according to the conventional training procedure.
topic acoustic model
adaptation
cluster analysis
reestimation
weighted mean vector.
url http://advances.utc.sk/index.php/AEEE/article/view/1448
work_keys_str_mv AT peterviszlay improvingtheslovaklvcsrperformancebyclustersensitiveacousticmodelretraining
AT marekecegi improvingtheslovaklvcsrperformancebyclustersensitiveacousticmodelretraining
AT josefjuhar improvingtheslovaklvcsrperformancebyclustersensitiveacousticmodelretraining
_version_ 1716828049659396096