Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
We introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorpo...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2021-04-01
|
Series: | Frontiers in Computer Science |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/fcomp.2021.624694/full |
id |
doaj-f38dce7d324844d888ebf9633700fc3b |
---|---|
record_format |
Article |
spelling |
doaj-f38dce7d324844d888ebf9633700fc3b2021-04-08T13:11:13ZengFrontiers Media S.A.Frontiers in Computer Science2624-98982021-04-01310.3389/fcomp.2021.624694624694Longitudinal Speech Biomarkers for Automated Alzheimer's DetectionJordi Laguarta0Brian Subirana1Brian Subirana2MIT AutoID Laboratory, Cambridge, MA, United StatesMIT AutoID Laboratory, Cambridge, MA, United StatesFaculty of Arts and Sciences, Harvard University, Cambridge, MA, United StatesWe introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorporate multimodal biomarkers and target simultaneously several diseases and other AI tasks. Key in our methodology is the use of multiple biomarkers complementing each other, and when two of them uniquely identify different subjects in a target disease we say they are orthogonal. We illustrate the OBVM design methodology by introducing sixteen biomarkers, three of which are orthogonal, demonstrating simultaneous above state-of-the-art discrimination for two apparently unrelated diseases such as AD and COVID-19. Depending on the context, throughout the paper we use OVBM indistinctly to refer to the specific architecture or to the broader design methodology. Inspired by research conducted at the MIT Center for Brain Minds and Machines (CBMM), OVBM combines biomarker implementations of the four modules of intelligence: The brain OS chunks and overlaps audio samples and aggregates biomarker features from the sensory stream and cognitive core creating a multi-modal graph neural network of symbolic compositional models for the target task. In this paper we apply the OVBM design methodology to the automated diagnostic of Alzheimer's Dementia (AD) patients, achieving above state-of-the-art accuracy of 93.8% using only raw audio, while extracting a personalized subject saliency map designed to longitudinally track relative disease progression using multiple biomarkers, 16 in the reported AD task. The ultimate aim is to help medical practice by detecting onset and treatment impact so that intervention options can be longitudinally tested. Using the OBVM design methodology, we introduce a novel lung and respiratory tract biomarker created using 200,000+ cough samples to pre-train a model discriminating cough cultural origin. Transfer Learning is subsequently used to incorporate features from this model into various other biomarker-based OVBM architectures. This biomarker yields consistent improvements in AD detection in all the starting OBVM biomarker architecture combinations we tried. This cough dataset sets a new benchmark as the largest audio health dataset with 30,000+ subjects participating in April 2020, demonstrating for the first time cough cultural bias.https://www.frontiersin.org/articles/10.3389/fcomp.2021.624694/fullmultimodal deep learningtransfer learningexplainable speech recognitionbrain modelgraph neural-networksAI diagnostics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jordi Laguarta Brian Subirana Brian Subirana |
spellingShingle |
Jordi Laguarta Brian Subirana Brian Subirana Longitudinal Speech Biomarkers for Automated Alzheimer's Detection Frontiers in Computer Science multimodal deep learning transfer learning explainable speech recognition brain model graph neural-networks AI diagnostics |
author_facet |
Jordi Laguarta Brian Subirana Brian Subirana |
author_sort |
Jordi Laguarta |
title |
Longitudinal Speech Biomarkers for Automated Alzheimer's Detection |
title_short |
Longitudinal Speech Biomarkers for Automated Alzheimer's Detection |
title_full |
Longitudinal Speech Biomarkers for Automated Alzheimer's Detection |
title_fullStr |
Longitudinal Speech Biomarkers for Automated Alzheimer's Detection |
title_full_unstemmed |
Longitudinal Speech Biomarkers for Automated Alzheimer's Detection |
title_sort |
longitudinal speech biomarkers for automated alzheimer's detection |
publisher |
Frontiers Media S.A. |
series |
Frontiers in Computer Science |
issn |
2624-9898 |
publishDate |
2021-04-01 |
description |
We introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorporate multimodal biomarkers and target simultaneously several diseases and other AI tasks. Key in our methodology is the use of multiple biomarkers complementing each other, and when two of them uniquely identify different subjects in a target disease we say they are orthogonal. We illustrate the OBVM design methodology by introducing sixteen biomarkers, three of which are orthogonal, demonstrating simultaneous above state-of-the-art discrimination for two apparently unrelated diseases such as AD and COVID-19. Depending on the context, throughout the paper we use OVBM indistinctly to refer to the specific architecture or to the broader design methodology. Inspired by research conducted at the MIT Center for Brain Minds and Machines (CBMM), OVBM combines biomarker implementations of the four modules of intelligence: The brain OS chunks and overlaps audio samples and aggregates biomarker features from the sensory stream and cognitive core creating a multi-modal graph neural network of symbolic compositional models for the target task. In this paper we apply the OVBM design methodology to the automated diagnostic of Alzheimer's Dementia (AD) patients, achieving above state-of-the-art accuracy of 93.8% using only raw audio, while extracting a personalized subject saliency map designed to longitudinally track relative disease progression using multiple biomarkers, 16 in the reported AD task. The ultimate aim is to help medical practice by detecting onset and treatment impact so that intervention options can be longitudinally tested. Using the OBVM design methodology, we introduce a novel lung and respiratory tract biomarker created using 200,000+ cough samples to pre-train a model discriminating cough cultural origin. Transfer Learning is subsequently used to incorporate features from this model into various other biomarker-based OVBM architectures. This biomarker yields consistent improvements in AD detection in all the starting OBVM biomarker architecture combinations we tried. This cough dataset sets a new benchmark as the largest audio health dataset with 30,000+ subjects participating in April 2020, demonstrating for the first time cough cultural bias. |
topic |
multimodal deep learning transfer learning explainable speech recognition brain model graph neural-networks AI diagnostics |
url |
https://www.frontiersin.org/articles/10.3389/fcomp.2021.624694/full |
work_keys_str_mv |
AT jordilaguarta longitudinalspeechbiomarkersforautomatedalzheimersdetection AT briansubirana longitudinalspeechbiomarkersforautomatedalzheimersdetection AT briansubirana longitudinalspeechbiomarkersforautomatedalzheimersdetection |
_version_ |
1721535123549061120 |