Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study

BackgroundThe initial symptoms of patients with COVID-19 are very much like those of patients with community-acquired pneumonia (CAP); it is difficult to distinguish COVID-19 from CAP with clinical symptoms and imaging examination. ObjectiveThe objective of our st...

Full description

Bibliographic Details
Main Authors: Dai, Wanfa, Ke, Pei-Feng, Li, Zhen-Zhen, Zhuang, Qi-Zhen, Huang, Wei, Wang, Yi, Xiong, Yujuan, Huang, Xian-Zhang
Format: Article
Language:English
Published: JMIR Publications 2021-02-01
Series:Journal of Medical Internet Research
Online Access:https://www.jmir.org/2021/2/e23390
id doaj-3ed4b9d414c64a7fa610d6f2728248c6
record_format Article
spelling doaj-3ed4b9d414c64a7fa610d6f2728248c62021-04-02T18:40:24ZengJMIR PublicationsJournal of Medical Internet Research1438-88712021-02-01232e2339010.2196/23390Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort StudyDai, WanfaKe, Pei-FengLi, Zhen-ZhenZhuang, Qi-ZhenHuang, WeiWang, YiXiong, YujuanHuang, Xian-Zhang BackgroundThe initial symptoms of patients with COVID-19 are very much like those of patients with community-acquired pneumonia (CAP); it is difficult to distinguish COVID-19 from CAP with clinical symptoms and imaging examination. ObjectiveThe objective of our study was to construct an effective model for the early identification of COVID-19 that would also distinguish it from CAP. MethodsThe clinical laboratory indicators (CLIs) of 61 COVID-19 patients and 60 CAP patients were analyzed retrospectively. Random combinations of various CLIs (ie, CLI combinations) were utilized to establish COVID-19 versus CAP classifiers with machine learning algorithms, including random forest classifier (RFC), logistic regression classifier, and gradient boosting classifier (GBC). The performance of the classifiers was assessed by calculating the area under the receiver operating characteristic curve (AUROC) and recall rate in COVID-19 prediction using the test data set. ResultsThe classifiers that were constructed with three algorithms from 43 CLI combinations showed high performance (recall rate >0.9 and AUROC >0.85) in COVID-19 prediction for the test data set. Among the high-performance classifiers, several CLIs showed a high usage rate; these included procalcitonin (PCT), mean corpuscular hemoglobin concentration (MCHC), uric acid, albumin, albumin to globulin ratio (AGR), neutrophil count, red blood cell (RBC) count, monocyte count, basophil count, and white blood cell (WBC) count. They also had high feature importance except for basophil count. The feature combination (FC) of PCT, AGR, uric acid, WBC count, neutrophil count, basophil count, RBC count, and MCHC was the representative one among the nine FCs used to construct the classifiers with an AUROC equal to 1.0 when using the RFC or GBC algorithms. Replacing any CLI in these FCs would lead to a significant reduction in the performance of the classifiers that were built with them. ConclusionsThe classifiers constructed with only a few specific CLIs could efficiently distinguish COVID-19 from CAP, which could help clinicians perform early isolation and centralized management of COVID-19 patients.https://www.jmir.org/2021/2/e23390
collection DOAJ
language English
format Article
sources DOAJ
author Dai, Wanfa
Ke, Pei-Feng
Li, Zhen-Zhen
Zhuang, Qi-Zhen
Huang, Wei
Wang, Yi
Xiong, Yujuan
Huang, Xian-Zhang
spellingShingle Dai, Wanfa
Ke, Pei-Feng
Li, Zhen-Zhen
Zhuang, Qi-Zhen
Huang, Wei
Wang, Yi
Xiong, Yujuan
Huang, Xian-Zhang
Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study
Journal of Medical Internet Research
author_facet Dai, Wanfa
Ke, Pei-Feng
Li, Zhen-Zhen
Zhuang, Qi-Zhen
Huang, Wei
Wang, Yi
Xiong, Yujuan
Huang, Xian-Zhang
author_sort Dai, Wanfa
title Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study
title_short Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study
title_full Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study
title_fullStr Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study
title_full_unstemmed Establishing Classifiers With Clinical Laboratory Indicators to Distinguish COVID-19 From Community-Acquired Pneumonia: Retrospective Cohort Study
title_sort establishing classifiers with clinical laboratory indicators to distinguish covid-19 from community-acquired pneumonia: retrospective cohort study
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2021-02-01
description BackgroundThe initial symptoms of patients with COVID-19 are very much like those of patients with community-acquired pneumonia (CAP); it is difficult to distinguish COVID-19 from CAP with clinical symptoms and imaging examination. ObjectiveThe objective of our study was to construct an effective model for the early identification of COVID-19 that would also distinguish it from CAP. MethodsThe clinical laboratory indicators (CLIs) of 61 COVID-19 patients and 60 CAP patients were analyzed retrospectively. Random combinations of various CLIs (ie, CLI combinations) were utilized to establish COVID-19 versus CAP classifiers with machine learning algorithms, including random forest classifier (RFC), logistic regression classifier, and gradient boosting classifier (GBC). The performance of the classifiers was assessed by calculating the area under the receiver operating characteristic curve (AUROC) and recall rate in COVID-19 prediction using the test data set. ResultsThe classifiers that were constructed with three algorithms from 43 CLI combinations showed high performance (recall rate >0.9 and AUROC >0.85) in COVID-19 prediction for the test data set. Among the high-performance classifiers, several CLIs showed a high usage rate; these included procalcitonin (PCT), mean corpuscular hemoglobin concentration (MCHC), uric acid, albumin, albumin to globulin ratio (AGR), neutrophil count, red blood cell (RBC) count, monocyte count, basophil count, and white blood cell (WBC) count. They also had high feature importance except for basophil count. The feature combination (FC) of PCT, AGR, uric acid, WBC count, neutrophil count, basophil count, RBC count, and MCHC was the representative one among the nine FCs used to construct the classifiers with an AUROC equal to 1.0 when using the RFC or GBC algorithms. Replacing any CLI in these FCs would lead to a significant reduction in the performance of the classifiers that were built with them. ConclusionsThe classifiers constructed with only a few specific CLIs could efficiently distinguish COVID-19 from CAP, which could help clinicians perform early isolation and centralized management of COVID-19 patients.
url https://www.jmir.org/2021/2/e23390
work_keys_str_mv AT daiwanfa establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT kepeifeng establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT lizhenzhen establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT zhuangqizhen establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT huangwei establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT wangyi establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT xiongyujuan establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
AT huangxianzhang establishingclassifierswithclinicallaboratoryindicatorstodistinguishcovid19fromcommunityacquiredpneumoniaretrospectivecohortstudy
_version_ 1721551250098487296