Experiments on Automatic Recognition of Nonnative Arabic Speech

<p/> <p>The automatic recognition of foreign-accented Arabic speech is a challenging task since it involves a large number of nonnative accents. As well, the nonnative speech data available for training are generally insufficient. Moreover, as compared to other languages, the Arabic lang...

Full description

Bibliographic Details
Main Authors: Selouani Sid-Ahmed, O&apos;Shaughnessy Douglas, Alotaibi YousefAjami
Format: Article
Language:English
Published: SpringerOpen 2008-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://asmp.eurasipjournals.com/content/2008/679831
id doaj-73903594d0cd42848769deea889ca14b
record_format Article
spelling doaj-73903594d0cd42848769deea889ca14b2020-11-25T00:29:31ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222008-01-0120081679831Experiments on Automatic Recognition of Nonnative Arabic SpeechSelouani Sid-AhmedO&apos;Shaughnessy DouglasAlotaibi YousefAjami<p/> <p>The automatic recognition of foreign-accented Arabic speech is a challenging task since it involves a large number of nonnative accents. As well, the nonnative speech data available for training are generally insufficient. Moreover, as compared to other languages, the Arabic language has sparked a relatively small number of research efforts. In this paper, we are concerned with the problem of nonnative speech in a speaker independent, large-vocabulary speech recognition system for modern standard Arabic (MSA). We analyze some major differences at the phonetic level in order to determine which phonemes have a significant part in the recognition performance for both native and nonnative speakers. Special attention is given to specific Arabic phonemes. The performance of an HMM-based Arabic speech recognition system is analyzed with respect to speaker gender and its native origin. The WestPoint modern standard Arabic database from the language data consortium (LDC) and the hidden Markov Model Toolkit (HTK) are used throughout all experiments. Our study shows that the best performance in the overall phoneme recognition is obtained when nonnative speakers are involved in both training and testing phases. This is not the case when a language model and phonetic lattice networks are incorporated in the system. At the phonetic level, the results show that female nonnative speakers perform better than nonnative male speakers, and that emphatic phonemes yield a significant decrease in performance when they are uttered by both male and female nonnative speakers.</p>http://asmp.eurasipjournals.com/content/2008/679831
collection DOAJ
language English
format Article
sources DOAJ
author Selouani Sid-Ahmed
O&apos;Shaughnessy Douglas
Alotaibi YousefAjami
spellingShingle Selouani Sid-Ahmed
O&apos;Shaughnessy Douglas
Alotaibi YousefAjami
Experiments on Automatic Recognition of Nonnative Arabic Speech
EURASIP Journal on Audio, Speech, and Music Processing
author_facet Selouani Sid-Ahmed
O&apos;Shaughnessy Douglas
Alotaibi YousefAjami
author_sort Selouani Sid-Ahmed
title Experiments on Automatic Recognition of Nonnative Arabic Speech
title_short Experiments on Automatic Recognition of Nonnative Arabic Speech
title_full Experiments on Automatic Recognition of Nonnative Arabic Speech
title_fullStr Experiments on Automatic Recognition of Nonnative Arabic Speech
title_full_unstemmed Experiments on Automatic Recognition of Nonnative Arabic Speech
title_sort experiments on automatic recognition of nonnative arabic speech
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2008-01-01
description <p/> <p>The automatic recognition of foreign-accented Arabic speech is a challenging task since it involves a large number of nonnative accents. As well, the nonnative speech data available for training are generally insufficient. Moreover, as compared to other languages, the Arabic language has sparked a relatively small number of research efforts. In this paper, we are concerned with the problem of nonnative speech in a speaker independent, large-vocabulary speech recognition system for modern standard Arabic (MSA). We analyze some major differences at the phonetic level in order to determine which phonemes have a significant part in the recognition performance for both native and nonnative speakers. Special attention is given to specific Arabic phonemes. The performance of an HMM-based Arabic speech recognition system is analyzed with respect to speaker gender and its native origin. The WestPoint modern standard Arabic database from the language data consortium (LDC) and the hidden Markov Model Toolkit (HTK) are used throughout all experiments. Our study shows that the best performance in the overall phoneme recognition is obtained when nonnative speakers are involved in both training and testing phases. This is not the case when a language model and phonetic lattice networks are incorporated in the system. At the phonetic level, the results show that female nonnative speakers perform better than nonnative male speakers, and that emphatic phonemes yield a significant decrease in performance when they are uttered by both male and female nonnative speakers.</p>
url http://asmp.eurasipjournals.com/content/2008/679831
work_keys_str_mv AT selouanisidahmed experimentsonautomaticrecognitionofnonnativearabicspeech
AT oaposshaughnessydouglas experimentsonautomaticrecognitionofnonnativearabicspeech
AT alotaibiyousefajami experimentsonautomaticrecognitionofnonnativearabicspeech
_version_ 1725330736116924416