Rule based learning of word pronunciations from training corpora

Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998. === Includes bibliographical references (leaves 83-85). === This paper describes a text-to-pronunciation system using transformation-based error-driven learning for speech-rec...

Full description

Bibliographic Details
Main Author:	Molnár, Lajos, 1975-
Other Authors:	Christopher M. Schmandt.
Format:	Others
Language:	English
Published:	Massachusetts Institute of Technology 2009
Subjects:	Electrical Engineering and Computer Science.
Online Access:	http://hdl.handle.net/1721.1/47906

id	ndltd-MIT-oai-dspace.mit.edu-1721.1-47906
record_format	oai_dc
spelling	ndltd-MIT-oai-dspace.mit.edu-1721.1-479062019-05-02T16:03:15Z Rule based learning of word pronunciations from training corpora Molnár, Lajos, 1975- Christopher M. Schmandt. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998. Includes bibliographical references (leaves 83-85). This paper describes a text-to-pronunciation system using transformation-based error-driven learning for speech-recognition purposes. Efforts have been made to make the system language independent, automatic, robust and able to generate multiple pronunciations. The learner proposes initial pronunciations for the words and finds transformations that bring the pronunciations closer to the correct pronunciations. The pronunciation generator works by applying the transformations to a similar initial pronunciation. A dynamic aligner is used for the necessary alignment of phonemes and graphemes. The pronunciations are scored using a weighed string edit distance. Optimizations were made to make the learner and the rule applier fast. The system achieves 73.9% exact word accuracy with multiple pronunciations, 82.3% word accuracy with one correct pronunciation, and 95.3% phoneme accuracy for English words. For proper names, it achieves 50.5% exact word accuracy, 69.2% word accuracy, and 92.0% phoneme accuracy, which outperforms the compared neural network approach. Lajos Molnár. M.Eng.and S.B. 2009-10-01T16:01:42Z 2009-10-01T16:01:42Z 1998 1998 Thesis http://hdl.handle.net/1721.1/47906 48205509 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 85 leaves application/pdf Massachusetts Institute of Technology
collection	NDLTD
language	English
format	Others
sources	NDLTD
topic	Electrical Engineering and Computer Science.
spellingShingle	Electrical Engineering and Computer Science. Molnár, Lajos, 1975- Rule based learning of word pronunciations from training corpora
description	Thesis (M.Eng. and S.B.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1998. === Includes bibliographical references (leaves 83-85). === This paper describes a text-to-pronunciation system using transformation-based error-driven learning for speech-recognition purposes. Efforts have been made to make the system language independent, automatic, robust and able to generate multiple pronunciations. The learner proposes initial pronunciations for the words and finds transformations that bring the pronunciations closer to the correct pronunciations. The pronunciation generator works by applying the transformations to a similar initial pronunciation. A dynamic aligner is used for the necessary alignment of phonemes and graphemes. The pronunciations are scored using a weighed string edit distance. Optimizations were made to make the learner and the rule applier fast. The system achieves 73.9% exact word accuracy with multiple pronunciations, 82.3% word accuracy with one correct pronunciation, and 95.3% phoneme accuracy for English words. For proper names, it achieves 50.5% exact word accuracy, 69.2% word accuracy, and 92.0% phoneme accuracy, which outperforms the compared neural network approach. === Lajos Molnár. === M.Eng.and S.B.
author2	Christopher M. Schmandt.
author_facet	Christopher M. Schmandt. Molnár, Lajos, 1975-
author	Molnár, Lajos, 1975-
author_sort	Molnár, Lajos, 1975-
title	Rule based learning of word pronunciations from training corpora
title_short	Rule based learning of word pronunciations from training corpora
title_full	Rule based learning of word pronunciations from training corpora
title_fullStr	Rule based learning of word pronunciations from training corpora
title_full_unstemmed	Rule based learning of word pronunciations from training corpora
title_sort	rule based learning of word pronunciations from training corpora
publisher	Massachusetts Institute of Technology
publishDate	2009
url	http://hdl.handle.net/1721.1/47906
work_keys_str_mv	AT molnarlajos1975 rulebasedlearningofwordpronunciationsfromtrainingcorpora
_version_	1719033916337160192

Rule based learning of word pronunciations from training corpora

Similar Items