AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records

BackgroundRisk scores can be useful in clinical risk stratification and accurate allocations of medical resources, helping health providers improve patient care. Point-based scores are more understandable and explainable than other complex models and are now widely used in cl...

Full description

Bibliographic Details
Main Authors: Xie, Feng, Chakraborty, Bibhas, Ong, Marcus Eng Hock, Goldstein, Benjamin Alan, Liu, Nan
Format: Article
Language:English
Published: JMIR Publications 2020-10-01
Series:JMIR Medical Informatics
Online Access:http://medinform.jmir.org/2020/10/e21798/
id doaj-46bc7b61d3a142bb9416ad217e1e2467
record_format Article
spelling doaj-46bc7b61d3a142bb9416ad217e1e24672021-05-03T01:42:56ZengJMIR PublicationsJMIR Medical Informatics2291-96942020-10-01810e2179810.2196/21798AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health RecordsXie, FengChakraborty, BibhasOng, Marcus Eng HockGoldstein, Benjamin AlanLiu, Nan BackgroundRisk scores can be useful in clinical risk stratification and accurate allocations of medical resources, helping health providers improve patient care. Point-based scores are more understandable and explainable than other complex models and are now widely used in clinical decision making. However, the development of the risk scoring model is nontrivial and has not yet been systematically presented, with few studies investigating methods of clinical score generation using electronic health records. ObjectiveThis study aims to propose AutoScore, a machine learning–based automatic clinical score generator consisting of 6 modules for developing interpretable point-based scores. Future users can employ the AutoScore framework to create clinical scores effortlessly in various clinical applications. MethodsWe proposed the AutoScore framework comprising 6 modules that included variable ranking, variable transformation, score derivation, model selection, score fine-tuning, and model evaluation. To demonstrate the performance of AutoScore, we used data from the Beth Israel Deaconess Medical Center to build a scoring model for mortality prediction and then compared the data with other baseline models using the receiver operating characteristic analysis. A software package in R 3.5.3 (R Foundation) was also developed to demonstrate the implementation of AutoScore. ResultsImplemented on the data set with 44,918 individual admission episodes of intensive care, the AutoScore-created scoring models performed comparably well as other standard methods (ie, logistic regression, stepwise regression, least absolute shrinkage and selection operator, and random forest) in terms of predictive accuracy and model calibration but required fewer predictors and presented high interpretability and accessibility. The nine-variable, AutoScore-created, point-based scoring model achieved an area under the curve (AUC) of 0.780 (95% CI 0.764-0.798), whereas the model of logistic regression with 24 variables had an AUC of 0.778 (95% CI 0.760-0.795). Moreover, the AutoScore framework also drives the clinical research continuum and automation with its integration of all necessary modules. ConclusionsWe developed an easy-to-use, machine learning–based automatic clinical score generator, AutoScore; systematically presented its structure; and demonstrated its superiority (predictive performance and interpretability) over other conventional methods using a benchmark database. AutoScore will emerge as a potential scoring tool in various medical applications.http://medinform.jmir.org/2020/10/e21798/
collection DOAJ
language English
format Article
sources DOAJ
author Xie, Feng
Chakraborty, Bibhas
Ong, Marcus Eng Hock
Goldstein, Benjamin Alan
Liu, Nan
spellingShingle Xie, Feng
Chakraborty, Bibhas
Ong, Marcus Eng Hock
Goldstein, Benjamin Alan
Liu, Nan
AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records
JMIR Medical Informatics
author_facet Xie, Feng
Chakraborty, Bibhas
Ong, Marcus Eng Hock
Goldstein, Benjamin Alan
Liu, Nan
author_sort Xie, Feng
title AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records
title_short AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records
title_full AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records
title_fullStr AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records
title_full_unstemmed AutoScore: A Machine Learning–Based Automatic Clinical Score Generator and Its Application to Mortality Prediction Using Electronic Health Records
title_sort autoscore: a machine learning–based automatic clinical score generator and its application to mortality prediction using electronic health records
publisher JMIR Publications
series JMIR Medical Informatics
issn 2291-9694
publishDate 2020-10-01
description BackgroundRisk scores can be useful in clinical risk stratification and accurate allocations of medical resources, helping health providers improve patient care. Point-based scores are more understandable and explainable than other complex models and are now widely used in clinical decision making. However, the development of the risk scoring model is nontrivial and has not yet been systematically presented, with few studies investigating methods of clinical score generation using electronic health records. ObjectiveThis study aims to propose AutoScore, a machine learning–based automatic clinical score generator consisting of 6 modules for developing interpretable point-based scores. Future users can employ the AutoScore framework to create clinical scores effortlessly in various clinical applications. MethodsWe proposed the AutoScore framework comprising 6 modules that included variable ranking, variable transformation, score derivation, model selection, score fine-tuning, and model evaluation. To demonstrate the performance of AutoScore, we used data from the Beth Israel Deaconess Medical Center to build a scoring model for mortality prediction and then compared the data with other baseline models using the receiver operating characteristic analysis. A software package in R 3.5.3 (R Foundation) was also developed to demonstrate the implementation of AutoScore. ResultsImplemented on the data set with 44,918 individual admission episodes of intensive care, the AutoScore-created scoring models performed comparably well as other standard methods (ie, logistic regression, stepwise regression, least absolute shrinkage and selection operator, and random forest) in terms of predictive accuracy and model calibration but required fewer predictors and presented high interpretability and accessibility. The nine-variable, AutoScore-created, point-based scoring model achieved an area under the curve (AUC) of 0.780 (95% CI 0.764-0.798), whereas the model of logistic regression with 24 variables had an AUC of 0.778 (95% CI 0.760-0.795). Moreover, the AutoScore framework also drives the clinical research continuum and automation with its integration of all necessary modules. ConclusionsWe developed an easy-to-use, machine learning–based automatic clinical score generator, AutoScore; systematically presented its structure; and demonstrated its superiority (predictive performance and interpretability) over other conventional methods using a benchmark database. AutoScore will emerge as a potential scoring tool in various medical applications.
url http://medinform.jmir.org/2020/10/e21798/
work_keys_str_mv AT xiefeng autoscoreamachinelearningbasedautomaticclinicalscoregeneratoranditsapplicationtomortalitypredictionusingelectronichealthrecords
AT chakrabortybibhas autoscoreamachinelearningbasedautomaticclinicalscoregeneratoranditsapplicationtomortalitypredictionusingelectronichealthrecords
AT ongmarcusenghock autoscoreamachinelearningbasedautomaticclinicalscoregeneratoranditsapplicationtomortalitypredictionusingelectronichealthrecords
AT goldsteinbenjaminalan autoscoreamachinelearningbasedautomaticclinicalscoregeneratoranditsapplicationtomortalitypredictionusingelectronichealthrecords
AT liunan autoscoreamachinelearningbasedautomaticclinicalscoregeneratoranditsapplicationtomortalitypredictionusingelectronichealthrecords
_version_ 1721485565248929792