SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION

This paper proposes an efficient method of simulated-data adaptation for robust speech recognition. The method is applied to tree-structured piecewise linear transformation (PLT). The original PLT selects an acoustic model using tree-structured HMMs and the acoustic model is adapted by input speech...

Full description

Bibliographic Details
Main Authors: Nattanun Thatphithakkul, Boontee Kruatrachue, Chai Wutiwiwatchai, Sanparith Marukatat, Vataya Boonpiam
Format: Article
Language:English
Published: Universitas Gadjah Mada 2017-11-01
Series:ASEAN Journal on Science and Technology for Development
Subjects:
Online Access:http://www.ajstd.org/index.php/ajstd/article/view/209
id doaj-3ab89abf66b8405fa81433ba0736bb4e
record_format Article
spelling doaj-3ab89abf66b8405fa81433ba0736bb4e2020-11-24T23:49:23ZengUniversitas Gadjah MadaASEAN Journal on Science and Technology for Development0217-54602224-90282017-11-0124433935210.29037/ajstd.209204SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITIONNattanun Thatphithakkul0Boontee Kruatrachue1Chai Wutiwiwatchai2Sanparith Marukatat3Vataya Boonpiam4King Mongkut’s Institute of Technology Ladkrabang, Bangkok, 10520King Mongkut’s Institute of Technology Ladkrabang, Bangkok, 10520National Electronics and Computer Technology Center, Bangkok, 12120National Electronics and Computer Technology Center, Bangkok, 12120National Electronics and Computer Technology Center, Bangkok, 12120This paper proposes an efficient method of simulated-data adaptation for robust speech recognition. The method is applied to tree-structured piecewise linear transformation (PLT). The original PLT selects an acoustic model using tree-structured HMMs and the acoustic model is adapted by input speech in an unsupervised scheme. This adaptation can degrade the acoustic model if the input speech is incorrectly transcribed during the adaptation process. Moreover, adaptation may not be effective if only the input speech is used. Our proposed method increases the size of adaptation data by adding noise portions from the input speech to a set of prerecorded clean speech, of which correct transcriptions are known. We investigate various configurations of the proposed method. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system reaches higher recognition rate than MLLR, HMM-based model selection and PLT.http://www.ajstd.org/index.php/ajstd/article/view/209Robust speech recognitionpiecewise linear transformationsimulated-data adaptation
collection DOAJ
language English
format Article
sources DOAJ
author Nattanun Thatphithakkul
Boontee Kruatrachue
Chai Wutiwiwatchai
Sanparith Marukatat
Vataya Boonpiam
spellingShingle Nattanun Thatphithakkul
Boontee Kruatrachue
Chai Wutiwiwatchai
Sanparith Marukatat
Vataya Boonpiam
SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
ASEAN Journal on Science and Technology for Development
Robust speech recognition
piecewise linear transformation
simulated-data adaptation
author_facet Nattanun Thatphithakkul
Boontee Kruatrachue
Chai Wutiwiwatchai
Sanparith Marukatat
Vataya Boonpiam
author_sort Nattanun Thatphithakkul
title SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
title_short SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
title_full SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
title_fullStr SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
title_full_unstemmed SIMULATED-DATA ADAPTATION BASED PIECEWISE LINEAR TRANSFORMATION FOR ROBUST SPEECH RECOGNITION
title_sort simulated-data adaptation based piecewise linear transformation for robust speech recognition
publisher Universitas Gadjah Mada
series ASEAN Journal on Science and Technology for Development
issn 0217-5460
2224-9028
publishDate 2017-11-01
description This paper proposes an efficient method of simulated-data adaptation for robust speech recognition. The method is applied to tree-structured piecewise linear transformation (PLT). The original PLT selects an acoustic model using tree-structured HMMs and the acoustic model is adapted by input speech in an unsupervised scheme. This adaptation can degrade the acoustic model if the input speech is incorrectly transcribed during the adaptation process. Moreover, adaptation may not be effective if only the input speech is used. Our proposed method increases the size of adaptation data by adding noise portions from the input speech to a set of prerecorded clean speech, of which correct transcriptions are known. We investigate various configurations of the proposed method. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system reaches higher recognition rate than MLLR, HMM-based model selection and PLT.
topic Robust speech recognition
piecewise linear transformation
simulated-data adaptation
url http://www.ajstd.org/index.php/ajstd/article/view/209
work_keys_str_mv AT nattanunthatphithakkul simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition
AT boonteekruatrachue simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition
AT chaiwutiwiwatchai simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition
AT sanparithmarukatat simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition
AT vatayaboonpiam simulateddataadaptationbasedpiecewiselineartransformationforrobustspeechrecognition
_version_ 1725482515250020352