Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech

Current methods for early diagnosis of Alzheimer's Dementia include structured questionnaires, structured interviews, and various cognitive tests. Language difficulties are a major problem in dementia as linguistic skills break down. Current methods do not provide robust tools to capture the tr...

Full description

Bibliographic Details
Main Authors: Pranav Mahajan, Veeky Baths
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-02-01
Series:Frontiers in Aging Neuroscience
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fnagi.2021.623607/full
id doaj-55b3815f7b394535ae80b7769b970d2e
record_format Article
spelling doaj-55b3815f7b394535ae80b7769b970d2e2021-02-05T07:46:22ZengFrontiers Media S.A.Frontiers in Aging Neuroscience1663-43652021-02-011310.3389/fnagi.2021.623607623607Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous SpeechPranav Mahajan0Veeky Baths1Cognitive Neuroscience Lab, Department of Electrical and Electronics Engineering, BITS Pilani University K. K. Birla Goa Campus, Pilani, IndiaCognitive Neuroscience Lab, Department of Biological Sciences, BITS Pilani University K. K. Birla Goa Campus, Pilani, IndiaCurrent methods for early diagnosis of Alzheimer's Dementia include structured questionnaires, structured interviews, and various cognitive tests. Language difficulties are a major problem in dementia as linguistic skills break down. Current methods do not provide robust tools to capture the true nature of language deficits in spontaneous speech. Early detection of Alzheimer's Dementia (AD) from spontaneous speech overcomes the limitations of earlier approaches as it is less time consuming, can be done at home, and is relatively inexpensive. In this work, we re-implement the existing NLP methods, which used CNN-LSTM architectures and targeted features from conversational transcripts. Our work sheds light on why the accuracy of these models drops to 72.92% on the ADReSS dataset, whereas, they gave state of the art results on the DementiaBank dataset. Further, we build upon these language input-based recurrent neural networks by devising an end-to-end deep learning-based solution that performs a binary classification of Alzheimer's Dementia from the spontaneous speech of the patients. We utilize the ADReSS dataset for all our implementations and explore the deep learning-based methods of combining acoustic features into a common vector using recurrent units. Our approach of combining acoustic features using the Speech-GRU improves the accuracy by 2% in comparison to acoustic baselines. When further enriched by targeted features, the Speech-GRU performs better than acoustic baselines by 6.25%. We propose a bi-modal approach for AD classification and discuss the merits and opportunities of our approach.https://www.frontiersin.org/articles/10.3389/fnagi.2021.623607/fullaffective computingcognitive decline detectionnatural language processingdeep learningcomputational paralinguistics
collection DOAJ
language English
format Article
sources DOAJ
author Pranav Mahajan
Veeky Baths
spellingShingle Pranav Mahajan
Veeky Baths
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
Frontiers in Aging Neuroscience
affective computing
cognitive decline detection
natural language processing
deep learning
computational paralinguistics
author_facet Pranav Mahajan
Veeky Baths
author_sort Pranav Mahajan
title Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
title_short Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
title_full Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
title_fullStr Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
title_full_unstemmed Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
title_sort acoustic and language based deep learning approaches for alzheimer's dementia detection from spontaneous speech
publisher Frontiers Media S.A.
series Frontiers in Aging Neuroscience
issn 1663-4365
publishDate 2021-02-01
description Current methods for early diagnosis of Alzheimer's Dementia include structured questionnaires, structured interviews, and various cognitive tests. Language difficulties are a major problem in dementia as linguistic skills break down. Current methods do not provide robust tools to capture the true nature of language deficits in spontaneous speech. Early detection of Alzheimer's Dementia (AD) from spontaneous speech overcomes the limitations of earlier approaches as it is less time consuming, can be done at home, and is relatively inexpensive. In this work, we re-implement the existing NLP methods, which used CNN-LSTM architectures and targeted features from conversational transcripts. Our work sheds light on why the accuracy of these models drops to 72.92% on the ADReSS dataset, whereas, they gave state of the art results on the DementiaBank dataset. Further, we build upon these language input-based recurrent neural networks by devising an end-to-end deep learning-based solution that performs a binary classification of Alzheimer's Dementia from the spontaneous speech of the patients. We utilize the ADReSS dataset for all our implementations and explore the deep learning-based methods of combining acoustic features into a common vector using recurrent units. Our approach of combining acoustic features using the Speech-GRU improves the accuracy by 2% in comparison to acoustic baselines. When further enriched by targeted features, the Speech-GRU performs better than acoustic baselines by 6.25%. We propose a bi-modal approach for AD classification and discuss the merits and opportunities of our approach.
topic affective computing
cognitive decline detection
natural language processing
deep learning
computational paralinguistics
url https://www.frontiersin.org/articles/10.3389/fnagi.2021.623607/full
work_keys_str_mv AT pranavmahajan acousticandlanguagebaseddeeplearningapproachesforalzheimersdementiadetectionfromspontaneousspeech
AT veekybaths acousticandlanguagebaseddeeplearningapproachesforalzheimersdementiadetectionfromspontaneousspeech
_version_ 1724284072280195072