DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network

The speech entailed in human voice comprises essentially paralinguistic information used in many voice-recognition applications. Gender voice is considered one of the pivotal parts to be detected from a given voice, a task that involves certain complications. In order to distinguish gender from a vo...

Full description

Bibliographic Details
Main Author: Rami S. Alkhawaldeh
Format: Article
Language:English
Published: Hindawi Limited 2019-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2019/7213717
id doaj-316af90abb4f4afe91af7b163cbe0681
record_format Article
spelling doaj-316af90abb4f4afe91af7b163cbe06812021-07-02T05:17:29ZengHindawi LimitedScientific Programming1058-92441875-919X2019-01-01201910.1155/2019/72137177213717DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural NetworkRami S. Alkhawaldeh0Department of Computer Information Systems, The University of Jordan, Aqaba 77110, JordanThe speech entailed in human voice comprises essentially paralinguistic information used in many voice-recognition applications. Gender voice is considered one of the pivotal parts to be detected from a given voice, a task that involves certain complications. In order to distinguish gender from a voice signal, a set of techniques have been employed to determine relevant features to be utilized for building a model from a training set. This model is useful for determining the gender (i.e., male or female) from a voice signal. The contributions are three-fold including (i) providing analysis information about well-known voice signal features using a prominent dataset, (ii) studying various machine learning models of different theoretical families to classify the voice gender, and (iii) using three prominent feature selection algorithms to find promisingly optimal features for improving classification models. The experimental results show the importance of subfeatures over others, which are vital for enhancing the efficiency of classification models’ performance. Experimentation reveals that the best recall value is equal to 99.97%; the best recall value is 99.7% for two models of deep learning (DL) and support vector machine (SVM), and with feature selection, the best recall value is 100% for SVM techniques.http://dx.doi.org/10.1155/2019/7213717
collection DOAJ
language English
format Article
sources DOAJ
author Rami S. Alkhawaldeh
spellingShingle Rami S. Alkhawaldeh
DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
Scientific Programming
author_facet Rami S. Alkhawaldeh
author_sort Rami S. Alkhawaldeh
title DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
title_short DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
title_full DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
title_fullStr DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
title_full_unstemmed DGR: Gender Recognition of Human Speech Using One-Dimensional Conventional Neural Network
title_sort dgr: gender recognition of human speech using one-dimensional conventional neural network
publisher Hindawi Limited
series Scientific Programming
issn 1058-9244
1875-919X
publishDate 2019-01-01
description The speech entailed in human voice comprises essentially paralinguistic information used in many voice-recognition applications. Gender voice is considered one of the pivotal parts to be detected from a given voice, a task that involves certain complications. In order to distinguish gender from a voice signal, a set of techniques have been employed to determine relevant features to be utilized for building a model from a training set. This model is useful for determining the gender (i.e., male or female) from a voice signal. The contributions are three-fold including (i) providing analysis information about well-known voice signal features using a prominent dataset, (ii) studying various machine learning models of different theoretical families to classify the voice gender, and (iii) using three prominent feature selection algorithms to find promisingly optimal features for improving classification models. The experimental results show the importance of subfeatures over others, which are vital for enhancing the efficiency of classification models’ performance. Experimentation reveals that the best recall value is equal to 99.97%; the best recall value is 99.7% for two models of deep learning (DL) and support vector machine (SVM), and with feature selection, the best recall value is 100% for SVM techniques.
url http://dx.doi.org/10.1155/2019/7213717
work_keys_str_mv AT ramisalkhawaldeh dgrgenderrecognitionofhumanspeechusingonedimensionalconventionalneuralnetwork
_version_ 1721338841676120064