Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech
Current methods for early diagnosis of Alzheimer's Dementia include structured questionnaires, structured interviews, and various cognitive tests. Language difficulties are a major problem in dementia as linguistic skills break down. Current methods do not provide robust tools to capture the tr...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-02-01
|
Series: | Frontiers in Aging Neuroscience |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fnagi.2021.623607/full |
id |
doaj-55b3815f7b394535ae80b7769b970d2e |
---|---|
record_format |
Article |
spelling |
doaj-55b3815f7b394535ae80b7769b970d2e2021-02-05T07:46:22ZengFrontiers Media S.A.Frontiers in Aging Neuroscience1663-43652021-02-011310.3389/fnagi.2021.623607623607Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous SpeechPranav Mahajan0Veeky Baths1Cognitive Neuroscience Lab, Department of Electrical and Electronics Engineering, BITS Pilani University K. K. Birla Goa Campus, Pilani, IndiaCognitive Neuroscience Lab, Department of Biological Sciences, BITS Pilani University K. K. Birla Goa Campus, Pilani, IndiaCurrent methods for early diagnosis of Alzheimer's Dementia include structured questionnaires, structured interviews, and various cognitive tests. Language difficulties are a major problem in dementia as linguistic skills break down. Current methods do not provide robust tools to capture the true nature of language deficits in spontaneous speech. Early detection of Alzheimer's Dementia (AD) from spontaneous speech overcomes the limitations of earlier approaches as it is less time consuming, can be done at home, and is relatively inexpensive. In this work, we re-implement the existing NLP methods, which used CNN-LSTM architectures and targeted features from conversational transcripts. Our work sheds light on why the accuracy of these models drops to 72.92% on the ADReSS dataset, whereas, they gave state of the art results on the DementiaBank dataset. Further, we build upon these language input-based recurrent neural networks by devising an end-to-end deep learning-based solution that performs a binary classification of Alzheimer's Dementia from the spontaneous speech of the patients. We utilize the ADReSS dataset for all our implementations and explore the deep learning-based methods of combining acoustic features into a common vector using recurrent units. Our approach of combining acoustic features using the Speech-GRU improves the accuracy by 2% in comparison to acoustic baselines. When further enriched by targeted features, the Speech-GRU performs better than acoustic baselines by 6.25%. We propose a bi-modal approach for AD classification and discuss the merits and opportunities of our approach.https://www.frontiersin.org/articles/10.3389/fnagi.2021.623607/fullaffective computingcognitive decline detectionnatural language processingdeep learningcomputational paralinguistics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Pranav Mahajan Veeky Baths |
spellingShingle |
Pranav Mahajan Veeky Baths Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech Frontiers in Aging Neuroscience affective computing cognitive decline detection natural language processing deep learning computational paralinguistics |
author_facet |
Pranav Mahajan Veeky Baths |
author_sort |
Pranav Mahajan |
title |
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech |
title_short |
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech |
title_full |
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech |
title_fullStr |
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech |
title_full_unstemmed |
Acoustic and Language Based Deep Learning Approaches for Alzheimer's Dementia Detection From Spontaneous Speech |
title_sort |
acoustic and language based deep learning approaches for alzheimer's dementia detection from spontaneous speech |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Aging Neuroscience |
issn |
1663-4365 |
publishDate |
2021-02-01 |
description |
Current methods for early diagnosis of Alzheimer's Dementia include structured questionnaires, structured interviews, and various cognitive tests. Language difficulties are a major problem in dementia as linguistic skills break down. Current methods do not provide robust tools to capture the true nature of language deficits in spontaneous speech. Early detection of Alzheimer's Dementia (AD) from spontaneous speech overcomes the limitations of earlier approaches as it is less time consuming, can be done at home, and is relatively inexpensive. In this work, we re-implement the existing NLP methods, which used CNN-LSTM architectures and targeted features from conversational transcripts. Our work sheds light on why the accuracy of these models drops to 72.92% on the ADReSS dataset, whereas, they gave state of the art results on the DementiaBank dataset. Further, we build upon these language input-based recurrent neural networks by devising an end-to-end deep learning-based solution that performs a binary classification of Alzheimer's Dementia from the spontaneous speech of the patients. We utilize the ADReSS dataset for all our implementations and explore the deep learning-based methods of combining acoustic features into a common vector using recurrent units. Our approach of combining acoustic features using the Speech-GRU improves the accuracy by 2% in comparison to acoustic baselines. When further enriched by targeted features, the Speech-GRU performs better than acoustic baselines by 6.25%. We propose a bi-modal approach for AD classification and discuss the merits and opportunities of our approach. |
topic |
affective computing cognitive decline detection natural language processing deep learning computational paralinguistics |
url |
https://www.frontiersin.org/articles/10.3389/fnagi.2021.623607/full |
work_keys_str_mv |
AT pranavmahajan acousticandlanguagebaseddeeplearningapproachesforalzheimersdementiadetectionfromspontaneousspeech AT veekybaths acousticandlanguagebaseddeeplearningapproachesforalzheimersdementiadetectionfromspontaneousspeech |
_version_ |
1724284072280195072 |