How to Design a Relevant Corpus for Sleepiness Detection Through Voice?

This article presents research on the detection of pathologies affecting speech through automatic analysis. Voice processing has indeed been used for evaluating several diseases such as Parkinson, Alzheimer, or depression. If some studies present results that seem sufficient for clinical application...

Full description

Bibliographic Details
Main Authors: Vincent P. Martin, Jean-Luc Rouas, Jean-Arthur Micoulaud-Franchi, Pierre Philip, Jarek Krajewski
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-09-01
Series:Frontiers in Digital Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fdgth.2021.686068/full
id doaj-c295d9f1710543bd828d9b7963cbb5e1
record_format Article
spelling doaj-c295d9f1710543bd828d9b7963cbb5e12021-09-22T05:26:28ZengFrontiers Media S.A.Frontiers in Digital Health2673-253X2021-09-01310.3389/fdgth.2021.686068686068How to Design a Relevant Corpus for Sleepiness Detection Through Voice?Vincent P. Martin0Jean-Luc Rouas1Jean-Arthur Micoulaud-Franchi2Pierre Philip3Jarek Krajewski4Laboratoire Bordelais de Recherche en Informatique, University of Bordeaux, CNRS–UMR 5800, Bordeaux INP, Talence, FranceLaboratoire Bordelais de Recherche en Informatique, University of Bordeaux, CNRS–UMR 5800, Bordeaux INP, Talence, FranceSommeil, Addiction et Neuropsychiatrie, University of Bordeaux, CNRS–USR 3413, CHU Pellegrin, Bordeaux, FranceSommeil, Addiction et Neuropsychiatrie, University of Bordeaux, CNRS–USR 3413, CHU Pellegrin, Bordeaux, FranceEngineering Psychology, Rhenish University of Applied Science, Cologne, GermanyThis article presents research on the detection of pathologies affecting speech through automatic analysis. Voice processing has indeed been used for evaluating several diseases such as Parkinson, Alzheimer, or depression. If some studies present results that seem sufficient for clinical applications, this is not the case for the detection of sleepiness. Even two international challenges and the recent advent of deep learning techniques have still not managed to change this situation. This article explores the hypothesis that the observed average performances of automatic processing find their cause in the design of the corpora. To this aim, we first discuss and refine the concept of sleepiness related to the ground-truth labels. Second, we present an in-depth study of four corpora, bringing to light the methodological choices that have been made and the underlying biases they may have induced. Finally, in light of this information, we propose guidelines for the design of new corpora.https://www.frontiersin.org/articles/10.3389/fdgth.2021.686068/fullsleepinessspeech processingcorpus designmethodological issueguidelines
collection DOAJ
language English
format Article
sources DOAJ
author Vincent P. Martin
Jean-Luc Rouas
Jean-Arthur Micoulaud-Franchi
Pierre Philip
Jarek Krajewski
spellingShingle Vincent P. Martin
Jean-Luc Rouas
Jean-Arthur Micoulaud-Franchi
Pierre Philip
Jarek Krajewski
How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
Frontiers in Digital Health
sleepiness
speech processing
corpus design
methodological issue
guidelines
author_facet Vincent P. Martin
Jean-Luc Rouas
Jean-Arthur Micoulaud-Franchi
Pierre Philip
Jarek Krajewski
author_sort Vincent P. Martin
title How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
title_short How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
title_full How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
title_fullStr How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
title_full_unstemmed How to Design a Relevant Corpus for Sleepiness Detection Through Voice?
title_sort how to design a relevant corpus for sleepiness detection through voice?
publisher Frontiers Media S.A.
series Frontiers in Digital Health
issn 2673-253X
publishDate 2021-09-01
description This article presents research on the detection of pathologies affecting speech through automatic analysis. Voice processing has indeed been used for evaluating several diseases such as Parkinson, Alzheimer, or depression. If some studies present results that seem sufficient for clinical applications, this is not the case for the detection of sleepiness. Even two international challenges and the recent advent of deep learning techniques have still not managed to change this situation. This article explores the hypothesis that the observed average performances of automatic processing find their cause in the design of the corpora. To this aim, we first discuss and refine the concept of sleepiness related to the ground-truth labels. Second, we present an in-depth study of four corpora, bringing to light the methodological choices that have been made and the underlying biases they may have induced. Finally, in light of this information, we propose guidelines for the design of new corpora.
topic sleepiness
speech processing
corpus design
methodological issue
guidelines
url https://www.frontiersin.org/articles/10.3389/fdgth.2021.686068/full
work_keys_str_mv AT vincentpmartin howtodesignarelevantcorpusforsleepinessdetectionthroughvoice
AT jeanlucrouas howtodesignarelevantcorpusforsleepinessdetectionthroughvoice
AT jeanarthurmicoulaudfranchi howtodesignarelevantcorpusforsleepinessdetectionthroughvoice
AT pierrephilip howtodesignarelevantcorpusforsleepinessdetectionthroughvoice
AT jarekkrajewski howtodesignarelevantcorpusforsleepinessdetectionthroughvoice
_version_ 1717371961209782272