Compact Acoustic Models for Embedded Speech Recognition

<p/> <p>Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approa...

Full description

Bibliographic Details
Main Authors: L&#233;vy Christophe, Linar&#232;s Georges, Bonastre Jean-Fran&#231;ois
Format: Article
Language:English
Published: SpringerOpen 2009-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://asmp.eurasipjournals.com/content/2009/806186
id doaj-0b329273063b482fa6887a2d793c59bd
record_format Article
spelling doaj-0b329273063b482fa6887a2d793c59bd2020-11-25T02:23:49ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222009-01-0120091806186Compact Acoustic Models for Embedded Speech RecognitionL&#233;vy ChristopheLinar&#232;s GeorgesBonastre Jean-Fran&#231;ois<p/> <p>Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce computational costs, the adaptation is performed only on the global model (using related speaker recognition adaptation techniques) with no need for state-dependent data. The whole approach results in a relative gain of more than 20% compared to a basic HMM-based system fitting the constraints.</p>http://asmp.eurasipjournals.com/content/2009/806186
collection DOAJ
language English
format Article
sources DOAJ
author L&#233;vy Christophe
Linar&#232;s Georges
Bonastre Jean-Fran&#231;ois
spellingShingle L&#233;vy Christophe
Linar&#232;s Georges
Bonastre Jean-Fran&#231;ois
Compact Acoustic Models for Embedded Speech Recognition
EURASIP Journal on Audio, Speech, and Music Processing
author_facet L&#233;vy Christophe
Linar&#232;s Georges
Bonastre Jean-Fran&#231;ois
author_sort L&#233;vy Christophe
title Compact Acoustic Models for Embedded Speech Recognition
title_short Compact Acoustic Models for Embedded Speech Recognition
title_full Compact Acoustic Models for Embedded Speech Recognition
title_fullStr Compact Acoustic Models for Embedded Speech Recognition
title_full_unstemmed Compact Acoustic Models for Embedded Speech Recognition
title_sort compact acoustic models for embedded speech recognition
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2009-01-01
description <p/> <p>Speech recognition applications are known to require a significant amount of resources. However, embedded speech recognition only authorizes few KB of memory, few MIPS, and small amount of training data. In order to fit the resource constraints of embedded applications, an approach based on a semicontinuous HMM system using state-independent acoustic modelling is proposed. A transformation is computed and applied to the global model in order to obtain each HMM state-dependent probability density functions, authorizing to store only the transformation parameters. This approach is evaluated on two tasks: digit and voice-command recognition. A fast adaptation technique of acoustic models is also proposed. In order to significantly reduce computational costs, the adaptation is performed only on the global model (using related speaker recognition adaptation techniques) with no need for state-dependent data. The whole approach results in a relative gain of more than 20% compared to a basic HMM-based system fitting the constraints.</p>
url http://asmp.eurasipjournals.com/content/2009/806186
work_keys_str_mv AT l233vychristophe compactacousticmodelsforembeddedspeechrecognition
AT linar232sgeorges compactacousticmodelsforembeddedspeechrecognition
AT bonastrejeanfran231ois compactacousticmodelsforembeddedspeechrecognition
_version_ 1724857058859155456