Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining
In this paper, we present a cluster-dependent adaptation approach for HMM-based acoustic models. The proposed approach employs clustering techniques to group the original training utterances into clusters with predefined number. The clustered speech data are intended to adapt an initially pre-traine...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
VSB-Technical University of Ostrava
2015-01-01
|
Series: | Advances in Electrical and Electronic Engineering |
Subjects: | |
Online Access: | http://advances.utc.sk/index.php/AEEE/article/view/1448 |
id |
doaj-aa729e7a68514e7792c52b0c39647ac4 |
---|---|
record_format |
Article |
spelling |
doaj-aa729e7a68514e7792c52b0c39647ac42021-10-11T08:03:05ZengVSB-Technical University of OstravaAdvances in Electrical and Electronic Engineering1336-13761804-31192015-01-0113429530210.15598/aeee.v13i4.1448749Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model RetrainingPeter Viszlay0Marek EcegiJosef JuharTechnical University of KošiceIn this paper, we present a cluster-dependent adaptation approach for HMM-based acoustic models. The proposed approach employs clustering techniques to group the original training utterances into clusters with predefined number. The clustered speech data are intended to adapt an initially pre-trained acoustic model to the specific cluster by reestimation based on the standard Baum-Welch procedure. The resulting model, adapted to the homogeneous data may markedly improve the baseline recognition rate, whereas the model complexity may be reduced. In the recognition step, the test samples are scored by each adapted model and the most accurate one is chosen. The proposed approach is thoroughly evaluated in Slovak triphone-based large vocabulary continuous speech recognition (LVCSR) system. The results prove that the cluster-sensitive retraining leads to significant improvements over the baseline reference system trained according to the conventional training procedure.http://advances.utc.sk/index.php/AEEE/article/view/1448acoustic modeladaptationcluster analysisreestimationweighted mean vector. |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Peter Viszlay Marek Ecegi Josef Juhar |
spellingShingle |
Peter Viszlay Marek Ecegi Josef Juhar Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining Advances in Electrical and Electronic Engineering acoustic model adaptation cluster analysis reestimation weighted mean vector. |
author_facet |
Peter Viszlay Marek Ecegi Josef Juhar |
author_sort |
Peter Viszlay |
title |
Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining |
title_short |
Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining |
title_full |
Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining |
title_fullStr |
Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining |
title_full_unstemmed |
Improving the Slovak LVCSR Performance by Cluster-Sensitive Acoustic Model Retraining |
title_sort |
improving the slovak lvcsr performance by cluster-sensitive acoustic model retraining |
publisher |
VSB-Technical University of Ostrava |
series |
Advances in Electrical and Electronic Engineering |
issn |
1336-1376 1804-3119 |
publishDate |
2015-01-01 |
description |
In this paper, we present a cluster-dependent adaptation approach for HMM-based acoustic models. The proposed approach employs clustering techniques to group the original training utterances into clusters with predefined number. The clustered speech data are intended to adapt an initially pre-trained acoustic model to the specific cluster by reestimation based on the standard Baum-Welch procedure. The resulting model, adapted to the homogeneous data may markedly improve the baseline recognition rate, whereas the model complexity may be reduced. In the recognition step, the test samples are scored by each adapted model and the most accurate one is chosen. The proposed approach is thoroughly evaluated in Slovak triphone-based large vocabulary continuous speech recognition (LVCSR) system. The results prove that the cluster-sensitive retraining leads to significant improvements over the baseline reference system trained according to the conventional training procedure. |
topic |
acoustic model adaptation cluster analysis reestimation weighted mean vector. |
url |
http://advances.utc.sk/index.php/AEEE/article/view/1448 |
work_keys_str_mv |
AT peterviszlay improvingtheslovaklvcsrperformancebyclustersensitiveacousticmodelretraining AT marekecegi improvingtheslovaklvcsrperformancebyclustersensitiveacousticmodelretraining AT josefjuhar improvingtheslovaklvcsrperformancebyclustersensitiveacousticmodelretraining |
_version_ |
1716828049659396096 |