Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)

Abstract Background Many patients with atrial fibrillation (AF) remain undiagnosed despite availability of interventions to reduce stroke risk. Predictive models to date are limited by data requirements and theoretical usage. We aimed to develop a model for predicting the 2-year probability of AF di...

Full description

Bibliographic Details
Main Authors: Randall W. Grout, Siu L. Hui, Timothy D. Imler, Sarah El-Azab, Jarod Baker, George H. Sands, Mohammad Ateya, Francis Pike
Format: Article
Language:English
Published: BMC 2021-04-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-021-01482-1
id doaj-06ecc851ca00477f966a363311362b9b
record_format Article
spelling doaj-06ecc851ca00477f966a363311362b9b2021-04-04T11:39:26ZengBMCBMC Medical Informatics and Decision Making1472-69472021-04-012111910.1186/s12911-021-01482-1Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)Randall W. Grout0Siu L. Hui1Timothy D. Imler2Sarah El-Azab3Jarod Baker4George H. Sands5Mohammad Ateya6Francis Pike7Center for Biomedical Informatics, Regenstrief InstituteCenter for Biomedical Informatics, Regenstrief InstituteCenter for Biomedical Informatics, Regenstrief InstituteResearch Services, Regenstrief InstituteCenter for Biomedical Informatics, Regenstrief InstitutePfizer Inc, US Medical AffairsPfizer Inc, US Medical AffairsDepartment of Biostatistics, Indiana University School of MedicineAbstract Background Many patients with atrial fibrillation (AF) remain undiagnosed despite availability of interventions to reduce stroke risk. Predictive models to date are limited by data requirements and theoretical usage. We aimed to develop a model for predicting the 2-year probability of AF diagnosis and implement it as proof-of-concept (POC) in a production electronic health record (EHR). Methods We used a nested case–control design using data from the Indiana Network for Patient Care. The development cohort came from 2016 to 2017 (outcome period) and 2014 to 2015 (baseline). A separate validation cohort used outcome and baseline periods shifted 2 years before respective development cohort times. Machine learning approaches were used to build predictive model. Patients ≥ 18 years, later restricted to age ≥ 40 years, with at least two encounters and no AF during baseline, were included. In the 6-week EHR prospective pilot, the model was silently implemented in the production system at a large safety-net urban hospital. Three new and two previous logistic regression models were evaluated using receiver-operating characteristics. Number, characteristics, and CHA2DS2-VASc scores of patients identified by the model in the pilot are presented. Results After restricting age to ≥ 40 years, 31,474 AF cases (mean age, 71.5 years; female 49%) and 22,078 controls (mean age, 59.5 years; female 61%) comprised the development cohort. A 10-variable model using age, acute heart disease, albumin, body mass index, chronic obstructive pulmonary disease, gender, heart failure, insurance, kidney disease, and shock yielded the best performance (C-statistic, 0.80 [95% CI 0.79–0.80]). The model performed well in the validation cohort (C-statistic, 0.81 [95% CI 0.8–0.81]). In the EHR pilot, 7916/22,272 (35.5%; mean age, 66 years; female 50%) were identified as higher risk for AF; 5582 (70%) had CHA2DS2-VASc score ≥ 2. Conclusions Using variables commonly available in the EHR, we created a predictive model to identify 2-year risk of developing AF in those previously without diagnosed AF. Successful POC implementation of the model in an EHR provided a practical strategy to identify patients who may benefit from interventions to reduce their stroke risk.https://doi.org/10.1186/s12911-021-01482-1Atrial fibrillationScreeningMachine learningPredictive modelElectronic health recordDecision support
collection DOAJ
language English
format Article
sources DOAJ
author Randall W. Grout
Siu L. Hui
Timothy D. Imler
Sarah El-Azab
Jarod Baker
George H. Sands
Mohammad Ateya
Francis Pike
spellingShingle Randall W. Grout
Siu L. Hui
Timothy D. Imler
Sarah El-Azab
Jarod Baker
George H. Sands
Mohammad Ateya
Francis Pike
Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
BMC Medical Informatics and Decision Making
Atrial fibrillation
Screening
Machine learning
Predictive model
Electronic health record
Decision support
author_facet Randall W. Grout
Siu L. Hui
Timothy D. Imler
Sarah El-Azab
Jarod Baker
George H. Sands
Mohammad Ateya
Francis Pike
author_sort Randall W. Grout
title Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
title_short Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
title_full Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
title_fullStr Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
title_full_unstemmed Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
title_sort development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (unafied)
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2021-04-01
description Abstract Background Many patients with atrial fibrillation (AF) remain undiagnosed despite availability of interventions to reduce stroke risk. Predictive models to date are limited by data requirements and theoretical usage. We aimed to develop a model for predicting the 2-year probability of AF diagnosis and implement it as proof-of-concept (POC) in a production electronic health record (EHR). Methods We used a nested case–control design using data from the Indiana Network for Patient Care. The development cohort came from 2016 to 2017 (outcome period) and 2014 to 2015 (baseline). A separate validation cohort used outcome and baseline periods shifted 2 years before respective development cohort times. Machine learning approaches were used to build predictive model. Patients ≥ 18 years, later restricted to age ≥ 40 years, with at least two encounters and no AF during baseline, were included. In the 6-week EHR prospective pilot, the model was silently implemented in the production system at a large safety-net urban hospital. Three new and two previous logistic regression models were evaluated using receiver-operating characteristics. Number, characteristics, and CHA2DS2-VASc scores of patients identified by the model in the pilot are presented. Results After restricting age to ≥ 40 years, 31,474 AF cases (mean age, 71.5 years; female 49%) and 22,078 controls (mean age, 59.5 years; female 61%) comprised the development cohort. A 10-variable model using age, acute heart disease, albumin, body mass index, chronic obstructive pulmonary disease, gender, heart failure, insurance, kidney disease, and shock yielded the best performance (C-statistic, 0.80 [95% CI 0.79–0.80]). The model performed well in the validation cohort (C-statistic, 0.81 [95% CI 0.8–0.81]). In the EHR pilot, 7916/22,272 (35.5%; mean age, 66 years; female 50%) were identified as higher risk for AF; 5582 (70%) had CHA2DS2-VASc score ≥ 2. Conclusions Using variables commonly available in the EHR, we created a predictive model to identify 2-year risk of developing AF in those previously without diagnosed AF. Successful POC implementation of the model in an EHR provided a practical strategy to identify patients who may benefit from interventions to reduce their stroke risk.
topic Atrial fibrillation
Screening
Machine learning
Predictive model
Electronic health record
Decision support
url https://doi.org/10.1186/s12911-021-01482-1
work_keys_str_mv AT randallwgrout developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT siulhui developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT timothydimler developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT sarahelazab developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT jarodbaker developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT georgehsands developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT mohammadateya developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
AT francispike developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied
_version_ 1721542453912141824