Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals

The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Acc...

Full description

Bibliographic Details
Main Authors: Zhengdong Lei, Evan Kennedy, Laura Fasanella, Nicole Yee-Key Li-Jessen, Luc Mongeau
Format: Article
Language:English
Published: MDPI AG 2019-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/9/7/1505
id doaj-3dfb699bc372457a93fbfe5589f415cd
record_format Article
spelling doaj-3dfb699bc372457a93fbfe5589f415cd2020-11-24T22:19:07ZengMDPI AGApplied Sciences2076-34172019-04-0197150510.3390/app9071505app9071505Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration SignalsZhengdong Lei0Evan Kennedy1Laura Fasanella2Nicole Yee-Key Li-Jessen3Luc Mongeau4Department of Mechanical Engineering, McGill University, Montreal, QC H3A 0G4, CanadaSchool of Communication Sciences and Disorders, McGill University, Montreal, QC H3A 0G4, CanadaDepartment of Mechanical Engineering, McGill University, Montreal, QC H3A 0G4, CanadaSchool of Communication Sciences and Disorders, McGill University, Montreal, QC H3A 0G4, CanadaDepartment of Mechanical Engineering, McGill University, Montreal, QC H3A 0G4, CanadaThe purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy.https://www.mdpi.com/2076-3417/9/7/1505neck-surface vibrationmachine learningvoice type discrimination
collection DOAJ
language English
format Article
sources DOAJ
author Zhengdong Lei
Evan Kennedy
Laura Fasanella
Nicole Yee-Key Li-Jessen
Luc Mongeau
spellingShingle Zhengdong Lei
Evan Kennedy
Laura Fasanella
Nicole Yee-Key Li-Jessen
Luc Mongeau
Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
Applied Sciences
neck-surface vibration
machine learning
voice type discrimination
author_facet Zhengdong Lei
Evan Kennedy
Laura Fasanella
Nicole Yee-Key Li-Jessen
Luc Mongeau
author_sort Zhengdong Lei
title Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_short Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_full Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_fullStr Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_full_unstemmed Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_sort discrimination between modal, breathy and pressed voice for single vowels using neck-surface vibration signals
publisher MDPI AG
series Applied Sciences
issn 2076-3417
publishDate 2019-04-01
description The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy.
topic neck-surface vibration
machine learning
voice type discrimination
url https://www.mdpi.com/2076-3417/9/7/1505
work_keys_str_mv AT zhengdonglei discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT evankennedy discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT laurafasanella discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT nicoleyeekeylijessen discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT lucmongeau discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
_version_ 1725779943858634752