Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms

<p/> <p>We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms (<it>batch mode</it> and <it>sequential</it>) are compared in speech recognition...

Full description

Bibliographic Details
Main Authors:	Cheng YanMing, Ma Changxue, Melnar Lynette
Format:	Article
Language:	English
Published:	SpringerOpen 2008-01-01
Series:	EURASIP Journal on Audio, Speech, and Music Processing
Online Access:	http://asmp.eurasipjournals.com/content/2008/568737

Description
Summary:	<p/> <p>We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms (<it>batch mode</it> and <it>sequential</it>) are compared in speech recognition experiments where they are first applied in a same-language context in which both acoustic model training and voice-tag creation and application are performed on the same language. Then, their performance is tested in a cross-language setting where the acoustic models are trained on a particular source language while the voice-tags are created and applied on a different target language. In the same-language environment, both algorithms either perform comparably to or significantly better than the baseline where utterances are manually transcribed by a phonetician. In the cross-language context, the voice-tag performances vary depending on the source-target language pair, with the variation reflecting predicted phonological similarity between the source and target languages. Among the most similar languages, performance nears that of the native-trained models and surpasses the native reference baseline.</p>
ISSN:	1687-4714 1687-4722

Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms

Similar Items