Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer

(1) Background: Lung cancer is silent in its early stages and fatal in its advanced stages. The current examinations for lung cancer are usually based on imaging. Conventional chest X-rays lack accuracy, and chest computed tomography (CT) is associated with radiation exposure and cost, limiting scre...

Full description

Bibliographic Details
Main Authors: Ping-Hsien Tsou, Zong-Lin Lin, Yu-Chiang Pan, Hui-Chen Yang, Chien-Jen Chang, Sheng-Kai Liang, Yueh-Feng Wen, Chia-Hao Chang, Lih-Yu Chang, Kai-Lun Yu, Chia-Jung Liu, Li-Ta Keng, Meng-Rui Lee, Jen-Chung Ko, Guan-Hua Huang, Yaw-Kuen Li
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Cancers
Subjects:
Online Access:https://www.mdpi.com/2072-6694/13/6/1431
id doaj-8ea0ef8a6f7f4c708bdcebedddcedf9f
record_format Article
spelling doaj-8ea0ef8a6f7f4c708bdcebedddcedf9f2021-03-22T00:00:43ZengMDPI AGCancers2072-66942021-03-01131431143110.3390/cancers13061431Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung CancerPing-Hsien Tsou0Zong-Lin Lin1Yu-Chiang Pan2Hui-Chen Yang3Chien-Jen Chang4Sheng-Kai Liang5Yueh-Feng Wen6Chia-Hao Chang7Lih-Yu Chang8Kai-Lun Yu9Chia-Jung Liu10Li-Ta Keng11Meng-Rui Lee12Jen-Chung Ko13Guan-Hua Huang14Yaw-Kuen Li15Department of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanInstitute of Statistics, National Chiao Tung University, Hsin-Chu 30010, TaiwanCenter for Emergent Functional Matter Science, National Chiao Tung University, Hsin-Chu 30010, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanDepartment of Internal Medicine, National Taiwan University Hospital, Hsin-Chu Branch, Hsin-Chu 30059, TaiwanInstitute of Statistics, National Chiao Tung University, Hsin-Chu 30010, TaiwanCenter for Emergent Functional Matter Science, National Chiao Tung University, Hsin-Chu 30010, Taiwan(1) Background: Lung cancer is silent in its early stages and fatal in its advanced stages. The current examinations for lung cancer are usually based on imaging. Conventional chest X-rays lack accuracy, and chest computed tomography (CT) is associated with radiation exposure and cost, limiting screening effectiveness. Breathomics, a noninvasive strategy, has recently been studied extensively. Volatile organic compounds (VOCs) derived from human breath can reflect metabolic changes caused by diseases and possibly serve as biomarkers of lung cancer. (2) Methods: The selected ion flow tube mass spectrometry (SIFT-MS) technique was used to quantitatively analyze 116 VOCs in breath samples from 148 patients with histologically confirmed lung cancers and 168 healthy volunteers. We used eXtreme Gradient Boosting (XGBoost), a machine learning method, to build a model for predicting lung cancer occurrence based on quantitative VOC measurements. (3) Results: The proposed prediction model achieved better performance than other previous approaches, with an accuracy, sensitivity, specificity, and area under the curve (AUC) of 0.89, 0.82, 0.94, and 0.95, respectively. When we further adjusted the confounding effect of environmental VOCs on the relationship between participants’ exhaled VOCs and lung cancer occurrence, our model was improved to reach 0.92 accuracy, 0.96 sensitivity, 0.88 specificity, and 0.98 AUC. (4) Conclusion: A quantitative VOCs databank integrated with the application of an XGBoost classifier provides a persuasive platform for lung cancer prediction.https://www.mdpi.com/2072-6694/13/6/1431volatile organic compoundsSIFT-MSXGBoostlung cancerbreath analysismachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Ping-Hsien Tsou
Zong-Lin Lin
Yu-Chiang Pan
Hui-Chen Yang
Chien-Jen Chang
Sheng-Kai Liang
Yueh-Feng Wen
Chia-Hao Chang
Lih-Yu Chang
Kai-Lun Yu
Chia-Jung Liu
Li-Ta Keng
Meng-Rui Lee
Jen-Chung Ko
Guan-Hua Huang
Yaw-Kuen Li
spellingShingle Ping-Hsien Tsou
Zong-Lin Lin
Yu-Chiang Pan
Hui-Chen Yang
Chien-Jen Chang
Sheng-Kai Liang
Yueh-Feng Wen
Chia-Hao Chang
Lih-Yu Chang
Kai-Lun Yu
Chia-Jung Liu
Li-Ta Keng
Meng-Rui Lee
Jen-Chung Ko
Guan-Hua Huang
Yaw-Kuen Li
Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer
Cancers
volatile organic compounds
SIFT-MS
XGBoost
lung cancer
breath analysis
machine learning
author_facet Ping-Hsien Tsou
Zong-Lin Lin
Yu-Chiang Pan
Hui-Chen Yang
Chien-Jen Chang
Sheng-Kai Liang
Yueh-Feng Wen
Chia-Hao Chang
Lih-Yu Chang
Kai-Lun Yu
Chia-Jung Liu
Li-Ta Keng
Meng-Rui Lee
Jen-Chung Ko
Guan-Hua Huang
Yaw-Kuen Li
author_sort Ping-Hsien Tsou
title Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer
title_short Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer
title_full Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer
title_fullStr Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer
title_full_unstemmed Exploring Volatile Organic Compounds in Breath for High-Accuracy Prediction of Lung Cancer
title_sort exploring volatile organic compounds in breath for high-accuracy prediction of lung cancer
publisher MDPI AG
series Cancers
issn 2072-6694
publishDate 2021-03-01
description (1) Background: Lung cancer is silent in its early stages and fatal in its advanced stages. The current examinations for lung cancer are usually based on imaging. Conventional chest X-rays lack accuracy, and chest computed tomography (CT) is associated with radiation exposure and cost, limiting screening effectiveness. Breathomics, a noninvasive strategy, has recently been studied extensively. Volatile organic compounds (VOCs) derived from human breath can reflect metabolic changes caused by diseases and possibly serve as biomarkers of lung cancer. (2) Methods: The selected ion flow tube mass spectrometry (SIFT-MS) technique was used to quantitatively analyze 116 VOCs in breath samples from 148 patients with histologically confirmed lung cancers and 168 healthy volunteers. We used eXtreme Gradient Boosting (XGBoost), a machine learning method, to build a model for predicting lung cancer occurrence based on quantitative VOC measurements. (3) Results: The proposed prediction model achieved better performance than other previous approaches, with an accuracy, sensitivity, specificity, and area under the curve (AUC) of 0.89, 0.82, 0.94, and 0.95, respectively. When we further adjusted the confounding effect of environmental VOCs on the relationship between participants’ exhaled VOCs and lung cancer occurrence, our model was improved to reach 0.92 accuracy, 0.96 sensitivity, 0.88 specificity, and 0.98 AUC. (4) Conclusion: A quantitative VOCs databank integrated with the application of an XGBoost classifier provides a persuasive platform for lung cancer prediction.
topic volatile organic compounds
SIFT-MS
XGBoost
lung cancer
breath analysis
machine learning
url https://www.mdpi.com/2072-6694/13/6/1431
work_keys_str_mv AT pinghsientsou exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT zonglinlin exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT yuchiangpan exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT huichenyang exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT chienjenchang exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT shengkailiang exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT yuehfengwen exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT chiahaochang exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT lihyuchang exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT kailunyu exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT chiajungliu exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT litakeng exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT mengruilee exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT jenchungko exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT guanhuahuang exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
AT yawkuenli exploringvolatileorganiccompoundsinbreathforhighaccuracypredictionoflungcancer
_version_ 1724209720680513536