Longitudinal Speech Biomarkers for Automated Alzheimer's Detection

We introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorpo...

Full description

Bibliographic Details
Main Authors: Jordi Laguarta, Brian Subirana
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-04-01
Series:Frontiers in Computer Science
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fcomp.2021.624694/full
id doaj-f38dce7d324844d888ebf9633700fc3b
record_format Article
spelling doaj-f38dce7d324844d888ebf9633700fc3b2021-04-08T13:11:13ZengFrontiers Media S.A.Frontiers in Computer Science2624-98982021-04-01310.3389/fcomp.2021.624694624694Longitudinal Speech Biomarkers for Automated Alzheimer's DetectionJordi Laguarta0Brian Subirana1Brian Subirana2MIT AutoID Laboratory, Cambridge, MA, United StatesMIT AutoID Laboratory, Cambridge, MA, United StatesFaculty of Arts and Sciences, Harvard University, Cambridge, MA, United StatesWe introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorporate multimodal biomarkers and target simultaneously several diseases and other AI tasks. Key in our methodology is the use of multiple biomarkers complementing each other, and when two of them uniquely identify different subjects in a target disease we say they are orthogonal. We illustrate the OBVM design methodology by introducing sixteen biomarkers, three of which are orthogonal, demonstrating simultaneous above state-of-the-art discrimination for two apparently unrelated diseases such as AD and COVID-19. Depending on the context, throughout the paper we use OVBM indistinctly to refer to the specific architecture or to the broader design methodology. Inspired by research conducted at the MIT Center for Brain Minds and Machines (CBMM), OVBM combines biomarker implementations of the four modules of intelligence: The brain OS chunks and overlaps audio samples and aggregates biomarker features from the sensory stream and cognitive core creating a multi-modal graph neural network of symbolic compositional models for the target task. In this paper we apply the OVBM design methodology to the automated diagnostic of Alzheimer's Dementia (AD) patients, achieving above state-of-the-art accuracy of 93.8% using only raw audio, while extracting a personalized subject saliency map designed to longitudinally track relative disease progression using multiple biomarkers, 16 in the reported AD task. The ultimate aim is to help medical practice by detecting onset and treatment impact so that intervention options can be longitudinally tested. Using the OBVM design methodology, we introduce a novel lung and respiratory tract biomarker created using 200,000+ cough samples to pre-train a model discriminating cough cultural origin. Transfer Learning is subsequently used to incorporate features from this model into various other biomarker-based OVBM architectures. This biomarker yields consistent improvements in AD detection in all the starting OBVM biomarker architecture combinations we tried. This cough dataset sets a new benchmark as the largest audio health dataset with 30,000+ subjects participating in April 2020, demonstrating for the first time cough cultural bias.https://www.frontiersin.org/articles/10.3389/fcomp.2021.624694/fullmultimodal deep learningtransfer learningexplainable speech recognitionbrain modelgraph neural-networksAI diagnostics
collection DOAJ
language English
format Article
sources DOAJ
author Jordi Laguarta
Brian Subirana
Brian Subirana
spellingShingle Jordi Laguarta
Brian Subirana
Brian Subirana
Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
Frontiers in Computer Science
multimodal deep learning
transfer learning
explainable speech recognition
brain model
graph neural-networks
AI diagnostics
author_facet Jordi Laguarta
Brian Subirana
Brian Subirana
author_sort Jordi Laguarta
title Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
title_short Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
title_full Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
title_fullStr Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
title_full_unstemmed Longitudinal Speech Biomarkers for Automated Alzheimer's Detection
title_sort longitudinal speech biomarkers for automated alzheimer's detection
publisher Frontiers Media S.A.
series Frontiers in Computer Science
issn 2624-9898
publishDate 2021-04-01
description We introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorporate multimodal biomarkers and target simultaneously several diseases and other AI tasks. Key in our methodology is the use of multiple biomarkers complementing each other, and when two of them uniquely identify different subjects in a target disease we say they are orthogonal. We illustrate the OBVM design methodology by introducing sixteen biomarkers, three of which are orthogonal, demonstrating simultaneous above state-of-the-art discrimination for two apparently unrelated diseases such as AD and COVID-19. Depending on the context, throughout the paper we use OVBM indistinctly to refer to the specific architecture or to the broader design methodology. Inspired by research conducted at the MIT Center for Brain Minds and Machines (CBMM), OVBM combines biomarker implementations of the four modules of intelligence: The brain OS chunks and overlaps audio samples and aggregates biomarker features from the sensory stream and cognitive core creating a multi-modal graph neural network of symbolic compositional models for the target task. In this paper we apply the OVBM design methodology to the automated diagnostic of Alzheimer's Dementia (AD) patients, achieving above state-of-the-art accuracy of 93.8% using only raw audio, while extracting a personalized subject saliency map designed to longitudinally track relative disease progression using multiple biomarkers, 16 in the reported AD task. The ultimate aim is to help medical practice by detecting onset and treatment impact so that intervention options can be longitudinally tested. Using the OBVM design methodology, we introduce a novel lung and respiratory tract biomarker created using 200,000+ cough samples to pre-train a model discriminating cough cultural origin. Transfer Learning is subsequently used to incorporate features from this model into various other biomarker-based OVBM architectures. This biomarker yields consistent improvements in AD detection in all the starting OBVM biomarker architecture combinations we tried. This cough dataset sets a new benchmark as the largest audio health dataset with 30,000+ subjects participating in April 2020, demonstrating for the first time cough cultural bias.
topic multimodal deep learning
transfer learning
explainable speech recognition
brain model
graph neural-networks
AI diagnostics
url https://www.frontiersin.org/articles/10.3389/fcomp.2021.624694/full
work_keys_str_mv AT jordilaguarta longitudinalspeechbiomarkersforautomatedalzheimersdetection
AT briansubirana longitudinalspeechbiomarkersforautomatedalzheimersdetection
AT briansubirana longitudinalspeechbiomarkersforautomatedalzheimersdetection
_version_ 1721535123549061120