SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
This paper proposes an efficient method of simulated-data adaptation for robust speech recognition. The method is applied to tree-structured piecewise linear transformation (PLT). The original PLT selects an acoustic model using tree-structured HMMs and the acoustic model is adapted by input speech...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Gadjah Mada
2017-11-01
|
Series: | ASEAN Journal on Science and Technology for Development |
Subjects: | |
Online Access: | http://www.ajstd.org/index.php/ajstd/article/view/209 |
id |
doaj-3ab89abf66b8405fa81433ba0736bb4e |
---|---|
record_format |
Article |
spelling |
doaj-3ab89abf66b8405fa81433ba0736bb4e2020-11-24T23:49:23ZengUniversitas Gadjah MadaASEAN Journal on Science and Technology for Development0217-54602224-90282017-11-0124433935210.29037/ajstd.209204SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITIONNattanun Thatphithakkul0Boontee Kruatrachue1Chai Wutiwiwatchai2Sanparith Marukatat3Vataya Boonpiam4King Mongkut’s Institute of Technology Ladkrabang, Bangkok, 10520King Mongkut’s Institute of Technology Ladkrabang, Bangkok, 10520National Electronics and Computer Technology Center, Bangkok, 12120National Electronics and Computer Technology Center, Bangkok, 12120National Electronics and Computer Technology Center, Bangkok, 12120This paper proposes an efficient method of simulated-data adaptation for robust speech recognition. The method is applied to tree-structured piecewise linear transformation (PLT). The original PLT selects an acoustic model using tree-structured HMMs and the acoustic model is adapted by input speech in an unsupervised scheme. This adaptation can degrade the acoustic model if the input speech is incorrectly transcribed during the adaptation process. Moreover, adaptation may not be effective if only the input speech is used. Our proposed method increases the size of adaptation data by adding noise portions from the input speech to a set of prerecorded clean speech, of which correct transcriptions are known. We investigate various configurations of the proposed method. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system reaches higher recognition rate than MLLR, HMM-based model selection and PLT.http://www.ajstd.org/index.php/ajstd/article/view/209Robust speech recognitionpiecewise linear transformationsimulated-data adaptation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Nattanun Thatphithakkul Boontee Kruatrachue Chai Wutiwiwatchai Sanparith Marukatat Vataya Boonpiam |
spellingShingle |
Nattanun Thatphithakkul Boontee Kruatrachue Chai Wutiwiwatchai Sanparith Marukatat Vataya Boonpiam SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION ASEAN Journal on Science and Technology for Development Robust speech recognition piecewise linear transformation simulated-data adaptation |
author_facet |
Nattanun Thatphithakkul Boontee Kruatrachue Chai Wutiwiwatchai Sanparith Marukatat Vataya Boonpiam |
author_sort |
Nattanun Thatphithakkul |
title |
SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION |
title_short |
SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION |
title_full |
SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION |
title_fullStr |
SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION |
title_full_unstemmed |
SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION |
title_sort |
simulated-data adaptation based piecewise linear transformation for robust speech recognition |
publisher |
Universitas Gadjah Mada |
series |
ASEAN Journal on Science and Technology for Development |
issn |
0217-5460 2224-9028 |
publishDate |
2017-11-01 |
description |
This paper proposes an efficient method of simulated-data adaptation for robust speech recognition. The method is applied to tree-structured piecewise linear transformation (PLT). The original PLT selects an acoustic model using tree-structured HMMs and the acoustic model is adapted by input speech in an unsupervised scheme. This adaptation can degrade the acoustic model if the input speech is incorrectly transcribed during the adaptation process. Moreover, adaptation may not be effective if only the input speech is used. Our proposed method increases the size of adaptation data by adding noise portions from the input speech to a set of prerecorded clean speech, of which correct transcriptions are known. We investigate various configurations of the proposed method. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system reaches higher recognition rate than MLLR, HMM-based model selection and PLT. |
topic |
Robust speech recognition piecewise linear transformation simulated-data adaptation |
url |
http://www.ajstd.org/index.php/ajstd/article/view/209 |
work_keys_str_mv |
AT nattanunthatphithakkul simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition AT boonteekruatrachue simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition AT chaiwutiwiwatchai simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition AT sanparithmarukatat simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition AT vatayaboonpiam simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition |
_version_ |
1725482515250020352 |