Machine learning based predictors for COVID-19 disease severity

Abstract Predictors of the need for intensive care and mechanical ventilation can help healthcare systems in planning for surge capacity for COVID-19. We used socio-demographic data, clinical data, and blood panel profile data at the time of initial presentation to develop machine learning algorithm...

Full description

Bibliographic Details
Main Authors: Dhruv Patel, Vikram Kher, Bhushan Desai, Xiaomeng Lei, Steven Cen, Neha Nanda, Ali Gholamrezanezhad, Vinay Duddalwar, Bino Varghese, Assad A Oberai
Format: Article
Language:English
Published: Nature Publishing Group 2021-02-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-83967-7
id doaj-98e212396cce4319a5ca65c1b2b5d377
record_format Article
spelling doaj-98e212396cce4319a5ca65c1b2b5d3772021-03-11T12:13:14ZengNature Publishing GroupScientific Reports2045-23222021-02-011111710.1038/s41598-021-83967-7Machine learning based predictors for COVID-19 disease severityDhruv Patel0Vikram Kher1Bhushan Desai2Xiaomeng Lei3Steven Cen4Neha Nanda5Ali Gholamrezanezhad6Vinay Duddalwar7Bino Varghese8Assad A Oberai9Viterbi School of Engineering, University of Southern CaliforniaViterbi School of Engineering, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaKeck School of Medicine, University of Southern CaliforniaViterbi School of Engineering, University of Southern CaliforniaAbstract Predictors of the need for intensive care and mechanical ventilation can help healthcare systems in planning for surge capacity for COVID-19. We used socio-demographic data, clinical data, and blood panel profile data at the time of initial presentation to develop machine learning algorithms for predicting the need for intensive care and mechanical ventilation. Among the algorithms considered, the Random Forest classifier performed the best with $$\text {AUC} = 0.80$$ AUC = 0.80 for predicting ICU need and $$\text {AUC} = 0.82$$ AUC = 0.82 for predicting the need for mechanical ventilation. We also determined the most influential features in making this prediction, and concluded that all three categories of data are important. We determined the relative importance of blood panel profile data and noted that the AUC dropped by 0.12 units when this data was not included, thus indicating that it provided valuable information in predicting disease severity. Finally, we generated RF predictors with a reduced set of five features that retained the performance of the predictors trained on all features. These predictors, which rely only on quantitative data, are less prone to errors and subjectivity.https://doi.org/10.1038/s41598-021-83967-7
collection DOAJ
language English
format Article
sources DOAJ
author Dhruv Patel
Vikram Kher
Bhushan Desai
Xiaomeng Lei
Steven Cen
Neha Nanda
Ali Gholamrezanezhad
Vinay Duddalwar
Bino Varghese
Assad A Oberai
spellingShingle Dhruv Patel
Vikram Kher
Bhushan Desai
Xiaomeng Lei
Steven Cen
Neha Nanda
Ali Gholamrezanezhad
Vinay Duddalwar
Bino Varghese
Assad A Oberai
Machine learning based predictors for COVID-19 disease severity
Scientific Reports
author_facet Dhruv Patel
Vikram Kher
Bhushan Desai
Xiaomeng Lei
Steven Cen
Neha Nanda
Ali Gholamrezanezhad
Vinay Duddalwar
Bino Varghese
Assad A Oberai
author_sort Dhruv Patel
title Machine learning based predictors for COVID-19 disease severity
title_short Machine learning based predictors for COVID-19 disease severity
title_full Machine learning based predictors for COVID-19 disease severity
title_fullStr Machine learning based predictors for COVID-19 disease severity
title_full_unstemmed Machine learning based predictors for COVID-19 disease severity
title_sort machine learning based predictors for covid-19 disease severity
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-02-01
description Abstract Predictors of the need for intensive care and mechanical ventilation can help healthcare systems in planning for surge capacity for COVID-19. We used socio-demographic data, clinical data, and blood panel profile data at the time of initial presentation to develop machine learning algorithms for predicting the need for intensive care and mechanical ventilation. Among the algorithms considered, the Random Forest classifier performed the best with $$\text {AUC} = 0.80$$ AUC = 0.80 for predicting ICU need and $$\text {AUC} = 0.82$$ AUC = 0.82 for predicting the need for mechanical ventilation. We also determined the most influential features in making this prediction, and concluded that all three categories of data are important. We determined the relative importance of blood panel profile data and noted that the AUC dropped by 0.12 units when this data was not included, thus indicating that it provided valuable information in predicting disease severity. Finally, we generated RF predictors with a reduced set of five features that retained the performance of the predictors trained on all features. These predictors, which rely only on quantitative data, are less prone to errors and subjectivity.
url https://doi.org/10.1038/s41598-021-83967-7
work_keys_str_mv AT dhruvpatel machinelearningbasedpredictorsforcovid19diseaseseverity
AT vikramkher machinelearningbasedpredictorsforcovid19diseaseseverity
AT bhushandesai machinelearningbasedpredictorsforcovid19diseaseseverity
AT xiaomenglei machinelearningbasedpredictorsforcovid19diseaseseverity
AT stevencen machinelearningbasedpredictorsforcovid19diseaseseverity
AT nehananda machinelearningbasedpredictorsforcovid19diseaseseverity
AT aligholamrezanezhad machinelearningbasedpredictorsforcovid19diseaseseverity
AT vinayduddalwar machinelearningbasedpredictorsforcovid19diseaseseverity
AT binovarghese machinelearningbasedpredictorsforcovid19diseaseseverity
AT assadaoberai machinelearningbasedpredictorsforcovid19diseaseseverity
_version_ 1724224612305207296