Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good...

Full description

Bibliographic Details
Main Authors: Asroni Asroni, Ku Ruhana Ku-Mahamud, Cahya Damarjati, Hasan Basri Slamat
Format: Article
Language:Arabic
Published: College of Science for Women, University of Baghdad 2021-06-01
Series:Baghdad Science Journal
Subjects:
Online Access:https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6213
id doaj-406a7117fbd64c5c807e8ad8c2d91e5e
record_format Article
spelling doaj-406a7117fbd64c5c807e8ad8c2d91e5e2021-06-20T15:51:20ZaraCollege of Science for Women, University of BaghdadBaghdad Science Journal2078-86652411-79862021-06-01182(Suppl.)10.21123/bsj.2021.18.2(Suppl.).0925Arabic Speech Classification Method Based on Padding and Deep Learning Neural NetworkAsroni Asroni0Ku Ruhana Ku-Mahamud 1Cahya Damarjati2Hasan Basri Slamat3Universitas Muhammadiyah Yogyakarta, IndonesiaUniversiti Utara MalaysiaUniversitas Muhammadiyah Yogyakarta, Indonesia. Universitas Muhammadiyah Yogyakarta, Indonesia Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to evaluate the pronunciation of the Arabic alphabet. Voice data from six school children are recorded and used to test the performance of the proposed method. The padding technique has been used to augment the voice data before feeding the data to the CNN structure to developed the classification model. In addition, three other feature extraction techniques have been introduced to enable the comparison of the proposed method which employs padding technique. The performance of the proposed method with padding technique is at par with the spectrogram but better than mel-spectrogram and mel-frequency cepstral coefficients. Results also show that the proposed method was able to distinguish the Arabic alphabets that are difficult to pronounce. The proposed method with padding technique may be extended to address other voice pronunciation ability other than the Arabic alphabets. https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6213Arabic alphabetdeep learningspeech classificationCOVID-19spectrogram
collection DOAJ
language Arabic
format Article
sources DOAJ
author Asroni Asroni
Ku Ruhana Ku-Mahamud
Cahya Damarjati
Hasan Basri Slamat
spellingShingle Asroni Asroni
Ku Ruhana Ku-Mahamud
Cahya Damarjati
Hasan Basri Slamat
Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
Baghdad Science Journal
Arabic alphabet
deep learning
speech classification
COVID-19
spectrogram
author_facet Asroni Asroni
Ku Ruhana Ku-Mahamud
Cahya Damarjati
Hasan Basri Slamat
author_sort Asroni Asroni
title Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
title_short Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
title_full Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
title_fullStr Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
title_full_unstemmed Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network
title_sort arabic speech classification method based on padding and deep learning neural network
publisher College of Science for Women, University of Baghdad
series Baghdad Science Journal
issn 2078-8665
2411-7986
publishDate 2021-06-01
description Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to evaluate the pronunciation of the Arabic alphabet. Voice data from six school children are recorded and used to test the performance of the proposed method. The padding technique has been used to augment the voice data before feeding the data to the CNN structure to developed the classification model. In addition, three other feature extraction techniques have been introduced to enable the comparison of the proposed method which employs padding technique. The performance of the proposed method with padding technique is at par with the spectrogram but better than mel-spectrogram and mel-frequency cepstral coefficients. Results also show that the proposed method was able to distinguish the Arabic alphabets that are difficult to pronounce. The proposed method with padding technique may be extended to address other voice pronunciation ability other than the Arabic alphabets.
topic Arabic alphabet
deep learning
speech classification
COVID-19
spectrogram
url https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/6213
work_keys_str_mv AT asroniasroni arabicspeechclassificationmethodbasedonpaddinganddeeplearningneuralnetwork
AT kuruhanakumahamud arabicspeechclassificationmethodbasedonpaddinganddeeplearningneuralnetwork
AT cahyadamarjati arabicspeechclassificationmethodbasedonpaddinganddeeplearningneuralnetwork
AT hasanbasrislamat arabicspeechclassificationmethodbasedonpaddinganddeeplearningneuralnetwork
_version_ 1721369473626144768