Machine learning applied to near-infrared spectra for clinical pleural effusion classification

Abstract Lung cancer patients with malignant pleural effusions (MPE) have a particular poor prognosis. It is crucial to distinguish MPE from benign pleural effusion (BPE). The present study aims to develop a rapid, convenient and economical diagnostic method based on FTIR near-infrared spectroscopy...

Full description

Bibliographic Details
Main Authors: Zhongjian Chen, Keke Chen, Yan Lou, Jing Zhu, Weimin Mao, Zhengbo Song
Format: Article
Language:English
Published: Nature Publishing Group 2021-05-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-87736-4
id doaj-3b349c99e6ef40e2a6dfbd1c8cdf50b5
record_format Article
spelling doaj-3b349c99e6ef40e2a6dfbd1c8cdf50b52021-05-09T11:33:44ZengNature Publishing GroupScientific Reports2045-23222021-05-011111810.1038/s41598-021-87736-4Machine learning applied to near-infrared spectra for clinical pleural effusion classificationZhongjian Chen0Keke Chen1Yan Lou2Jing Zhu3Weimin Mao4Zhengbo Song5Cancer Hospital of the University of Chinese Academy of Sciences, Chinese Academy of SciencesCancer Hospital of the University of Chinese Academy of Sciences, Chinese Academy of SciencesIntensive Care Unit, Zhejiang Medical & Health Group Hangzhou HospitalCancer Hospital of the University of Chinese Academy of Sciences, Chinese Academy of SciencesCancer Hospital of the University of Chinese Academy of Sciences, Chinese Academy of SciencesCancer Hospital of the University of Chinese Academy of Sciences, Chinese Academy of SciencesAbstract Lung cancer patients with malignant pleural effusions (MPE) have a particular poor prognosis. It is crucial to distinguish MPE from benign pleural effusion (BPE). The present study aims to develop a rapid, convenient and economical diagnostic method based on FTIR near-infrared spectroscopy (NIRS) combined with machine learning strategy for clinical pleural effusion classification. NIRS spectra were recorded for 47 MPE samples and 35 BPE samples. The sample data were randomly divided into train set (n = 62) and test set (n = 20). Partial least squares, random forest, support vector machine (SVM), and gradient boosting machine models were trained, and subsequent predictive performance were predicted on the test set. Besides the whole spectra used in modeling, selected features using SVM recursive feature elimination algorithm were also investigated in modeling. Among those models, NIRS combined with SVM showed the best predictive performance (accuracy: 1.0, kappa: 1.0, and AUCROC: 1.0). SVM with the top 50 feature wavenumbers also displayed a high predictive performance (accuracy: 0.95, kappa: 0.89, AUCROC: 0.99). Our study revealed that the combination of NIRS and machine learning is an innovative, rapid, and convenient method for clinical pleural effusion classification, and worth further evaluation.https://doi.org/10.1038/s41598-021-87736-4
collection DOAJ
language English
format Article
sources DOAJ
author Zhongjian Chen
Keke Chen
Yan Lou
Jing Zhu
Weimin Mao
Zhengbo Song
spellingShingle Zhongjian Chen
Keke Chen
Yan Lou
Jing Zhu
Weimin Mao
Zhengbo Song
Machine learning applied to near-infrared spectra for clinical pleural effusion classification
Scientific Reports
author_facet Zhongjian Chen
Keke Chen
Yan Lou
Jing Zhu
Weimin Mao
Zhengbo Song
author_sort Zhongjian Chen
title Machine learning applied to near-infrared spectra for clinical pleural effusion classification
title_short Machine learning applied to near-infrared spectra for clinical pleural effusion classification
title_full Machine learning applied to near-infrared spectra for clinical pleural effusion classification
title_fullStr Machine learning applied to near-infrared spectra for clinical pleural effusion classification
title_full_unstemmed Machine learning applied to near-infrared spectra for clinical pleural effusion classification
title_sort machine learning applied to near-infrared spectra for clinical pleural effusion classification
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-05-01
description Abstract Lung cancer patients with malignant pleural effusions (MPE) have a particular poor prognosis. It is crucial to distinguish MPE from benign pleural effusion (BPE). The present study aims to develop a rapid, convenient and economical diagnostic method based on FTIR near-infrared spectroscopy (NIRS) combined with machine learning strategy for clinical pleural effusion classification. NIRS spectra were recorded for 47 MPE samples and 35 BPE samples. The sample data were randomly divided into train set (n = 62) and test set (n = 20). Partial least squares, random forest, support vector machine (SVM), and gradient boosting machine models were trained, and subsequent predictive performance were predicted on the test set. Besides the whole spectra used in modeling, selected features using SVM recursive feature elimination algorithm were also investigated in modeling. Among those models, NIRS combined with SVM showed the best predictive performance (accuracy: 1.0, kappa: 1.0, and AUCROC: 1.0). SVM with the top 50 feature wavenumbers also displayed a high predictive performance (accuracy: 0.95, kappa: 0.89, AUCROC: 0.99). Our study revealed that the combination of NIRS and machine learning is an innovative, rapid, and convenient method for clinical pleural effusion classification, and worth further evaluation.
url https://doi.org/10.1038/s41598-021-87736-4
work_keys_str_mv AT zhongjianchen machinelearningappliedtonearinfraredspectraforclinicalpleuraleffusionclassification
AT kekechen machinelearningappliedtonearinfraredspectraforclinicalpleuraleffusionclassification
AT yanlou machinelearningappliedtonearinfraredspectraforclinicalpleuraleffusionclassification
AT jingzhu machinelearningappliedtonearinfraredspectraforclinicalpleuraleffusionclassification
AT weiminmao machinelearningappliedtonearinfraredspectraforclinicalpleuraleffusionclassification
AT zhengbosong machinelearningappliedtonearinfraredspectraforclinicalpleuraleffusionclassification
_version_ 1721454249912565760