Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)
Abstract Background Many patients with atrial fibrillation (AF) remain undiagnosed despite availability of interventions to reduce stroke risk. Predictive models to date are limited by data requirements and theoretical usage. We aimed to develop a model for predicting the 2-year probability of AF di...
Main Authors: | , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
BMC
2021-04-01
|
Series: | BMC Medical Informatics and Decision Making |
Subjects: | |
Online Access: | https://doi.org/10.1186/s12911-021-01482-1 |
id |
doaj-06ecc851ca00477f966a363311362b9b |
---|---|
record_format |
Article |
spelling |
doaj-06ecc851ca00477f966a363311362b9b2021-04-04T11:39:26ZengBMCBMC Medical Informatics and Decision Making1472-69472021-04-012111910.1186/s12911-021-01482-1Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED)Randall W. Grout0Siu L. Hui1Timothy D. Imler2Sarah El-Azab3Jarod Baker4George H. Sands5Mohammad Ateya6Francis Pike7Center for Biomedical Informatics, Regenstrief InstituteCenter for Biomedical Informatics, Regenstrief InstituteCenter for Biomedical Informatics, Regenstrief InstituteResearch Services, Regenstrief InstituteCenter for Biomedical Informatics, Regenstrief InstitutePfizer Inc, US Medical AffairsPfizer Inc, US Medical AffairsDepartment of Biostatistics, Indiana University School of MedicineAbstract Background Many patients with atrial fibrillation (AF) remain undiagnosed despite availability of interventions to reduce stroke risk. Predictive models to date are limited by data requirements and theoretical usage. We aimed to develop a model for predicting the 2-year probability of AF diagnosis and implement it as proof-of-concept (POC) in a production electronic health record (EHR). Methods We used a nested case–control design using data from the Indiana Network for Patient Care. The development cohort came from 2016 to 2017 (outcome period) and 2014 to 2015 (baseline). A separate validation cohort used outcome and baseline periods shifted 2 years before respective development cohort times. Machine learning approaches were used to build predictive model. Patients ≥ 18 years, later restricted to age ≥ 40 years, with at least two encounters and no AF during baseline, were included. In the 6-week EHR prospective pilot, the model was silently implemented in the production system at a large safety-net urban hospital. Three new and two previous logistic regression models were evaluated using receiver-operating characteristics. Number, characteristics, and CHA2DS2-VASc scores of patients identified by the model in the pilot are presented. Results After restricting age to ≥ 40 years, 31,474 AF cases (mean age, 71.5 years; female 49%) and 22,078 controls (mean age, 59.5 years; female 61%) comprised the development cohort. A 10-variable model using age, acute heart disease, albumin, body mass index, chronic obstructive pulmonary disease, gender, heart failure, insurance, kidney disease, and shock yielded the best performance (C-statistic, 0.80 [95% CI 0.79–0.80]). The model performed well in the validation cohort (C-statistic, 0.81 [95% CI 0.8–0.81]). In the EHR pilot, 7916/22,272 (35.5%; mean age, 66 years; female 50%) were identified as higher risk for AF; 5582 (70%) had CHA2DS2-VASc score ≥ 2. Conclusions Using variables commonly available in the EHR, we created a predictive model to identify 2-year risk of developing AF in those previously without diagnosed AF. Successful POC implementation of the model in an EHR provided a practical strategy to identify patients who may benefit from interventions to reduce their stroke risk.https://doi.org/10.1186/s12911-021-01482-1Atrial fibrillationScreeningMachine learningPredictive modelElectronic health recordDecision support |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Randall W. Grout Siu L. Hui Timothy D. Imler Sarah El-Azab Jarod Baker George H. Sands Mohammad Ateya Francis Pike |
spellingShingle |
Randall W. Grout Siu L. Hui Timothy D. Imler Sarah El-Azab Jarod Baker George H. Sands Mohammad Ateya Francis Pike Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED) BMC Medical Informatics and Decision Making Atrial fibrillation Screening Machine learning Predictive model Electronic health record Decision support |
author_facet |
Randall W. Grout Siu L. Hui Timothy D. Imler Sarah El-Azab Jarod Baker George H. Sands Mohammad Ateya Francis Pike |
author_sort |
Randall W. Grout |
title |
Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED) |
title_short |
Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED) |
title_full |
Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED) |
title_fullStr |
Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED) |
title_full_unstemmed |
Development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (UNAFIED) |
title_sort |
development, validation, and proof-of-concept implementation of a two-year risk prediction model for undiagnosed atrial fibrillation using common electronic health data (unafied) |
publisher |
BMC |
series |
BMC Medical Informatics and Decision Making |
issn |
1472-6947 |
publishDate |
2021-04-01 |
description |
Abstract Background Many patients with atrial fibrillation (AF) remain undiagnosed despite availability of interventions to reduce stroke risk. Predictive models to date are limited by data requirements and theoretical usage. We aimed to develop a model for predicting the 2-year probability of AF diagnosis and implement it as proof-of-concept (POC) in a production electronic health record (EHR). Methods We used a nested case–control design using data from the Indiana Network for Patient Care. The development cohort came from 2016 to 2017 (outcome period) and 2014 to 2015 (baseline). A separate validation cohort used outcome and baseline periods shifted 2 years before respective development cohort times. Machine learning approaches were used to build predictive model. Patients ≥ 18 years, later restricted to age ≥ 40 years, with at least two encounters and no AF during baseline, were included. In the 6-week EHR prospective pilot, the model was silently implemented in the production system at a large safety-net urban hospital. Three new and two previous logistic regression models were evaluated using receiver-operating characteristics. Number, characteristics, and CHA2DS2-VASc scores of patients identified by the model in the pilot are presented. Results After restricting age to ≥ 40 years, 31,474 AF cases (mean age, 71.5 years; female 49%) and 22,078 controls (mean age, 59.5 years; female 61%) comprised the development cohort. A 10-variable model using age, acute heart disease, albumin, body mass index, chronic obstructive pulmonary disease, gender, heart failure, insurance, kidney disease, and shock yielded the best performance (C-statistic, 0.80 [95% CI 0.79–0.80]). The model performed well in the validation cohort (C-statistic, 0.81 [95% CI 0.8–0.81]). In the EHR pilot, 7916/22,272 (35.5%; mean age, 66 years; female 50%) were identified as higher risk for AF; 5582 (70%) had CHA2DS2-VASc score ≥ 2. Conclusions Using variables commonly available in the EHR, we created a predictive model to identify 2-year risk of developing AF in those previously without diagnosed AF. Successful POC implementation of the model in an EHR provided a practical strategy to identify patients who may benefit from interventions to reduce their stroke risk. |
topic |
Atrial fibrillation Screening Machine learning Predictive model Electronic health record Decision support |
url |
https://doi.org/10.1186/s12911-021-01482-1 |
work_keys_str_mv |
AT randallwgrout developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT siulhui developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT timothydimler developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT sarahelazab developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT jarodbaker developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT georgehsands developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT mohammadateya developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied AT francispike developmentvalidationandproofofconceptimplementationofatwoyearriskpredictionmodelforundiagnosedatrialfibrillationusingcommonelectronichealthdataunafied |
_version_ |
1721542453912141824 |