Multimodal inductive transfer learning for detection of Alzheimer's dementia and its severity

Copyright © 2020 ISCA Alzheimer's disease is estimated to affect around 50 million people worldwide and is rising rapidly, with a global economic burden of nearly a trillion dollars. This calls for scalable, cost-effective, and robust methods for detection of Alzheimer's dementia (AD). We...

Full description

Bibliographic Details
Main Authors: Sarawgi, U (Author), Zulfikar, W (Author), Soliman, N (Author), Maes, P (Author)
Format: Article
Language:English
Published: ISCA, 2021-11-02T14:29:17Z.
Subjects:
Online Access:Get fulltext
LEADER 02118 am a22001933u 4500
001 137095
042 |a dc 
100 1 0 |a Sarawgi, U  |e author 
700 1 0 |a Zulfikar, W  |e author 
700 1 0 |a Soliman, N  |e author 
700 1 0 |a Maes, P  |e author 
245 0 0 |a Multimodal inductive transfer learning for detection of Alzheimer's dementia and its severity 
260 |b ISCA,   |c 2021-11-02T14:29:17Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/137095 
520 |a Copyright © 2020 ISCA Alzheimer's disease is estimated to affect around 50 million people worldwide and is rising rapidly, with a global economic burden of nearly a trillion dollars. This calls for scalable, cost-effective, and robust methods for detection of Alzheimer's dementia (AD). We present a novel architecture that leverages acoustic, cognitive, and linguistic features to form a multimodal ensemble system. It uses specialized artificial neural networks with temporal characteristics to detect AD and its severity, which is reflected through Mini-Mental State Exam (MMSE) scores. We first evaluate it on the ADReSS challenge dataset, which is a subject-independent and balanced dataset matched for age and gender to mitigate biases, and is available through DementiaBank. Our system achieves state-of-the-art test accuracy, precision, recall, and F1-score of 83.3% each for AD classification, and state-of-the-art test root mean squared error (RMSE) of 4.60 for MMSE score regression. To the best of our knowledge, the system further achieves state-of-the-art AD classification accuracy of 88.0% when evaluated on the full benchmark DementiaBank Pitt database. Our work highlights the applicability and transferability of spontaneous speech to produce a robust inductive transfer learning model, and demonstrates generalizability through a task-agnostic feature-space. The source code is available at https://github.com/wazeerzulfikar/alzheimers-dementia. 
546 |a en 
655 7 |a Article 
773 |t 10.21437/Interspeech.2020-3137 
773 |t Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH