Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.

We aimed to evaluate a computer-aided diagnosis (CADx) system for lung nodule classification focussing on (i) usefulness of the conventional CADx system (hand-crafted imaging feature + machine learning algorithm), (ii) comparison between support vector machine (SVM) and gradient tree boosting (XGBoo...

Full description

Bibliographic Details
Main Authors: Mizuho Nishio, Mitsuo Nishizawa, Osamu Sugiyama, Ryosuke Kojima, Masahiro Yakami, Tomohiro Kuroda, Kaori Togashi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5908232?pdf=render
id doaj-dcba7553c7994c0b928037b64b1e7ef6
record_format Article
spelling doaj-dcba7553c7994c0b928037b64b1e7ef62020-11-25T01:24:20ZengPublic Library of Science (PLoS)PLoS ONE1932-62032018-01-01134e019587510.1371/journal.pone.0195875Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.Mizuho NishioMitsuo NishizawaOsamu SugiyamaRyosuke KojimaMasahiro YakamiTomohiro KurodaKaori TogashiWe aimed to evaluate a computer-aided diagnosis (CADx) system for lung nodule classification focussing on (i) usefulness of the conventional CADx system (hand-crafted imaging feature + machine learning algorithm), (ii) comparison between support vector machine (SVM) and gradient tree boosting (XGBoost) as machine learning algorithms, and (iii) effectiveness of parameter optimization using Bayesian optimization and random search. Data on 99 lung nodules (62 lung cancers and 37 benign lung nodules) were included from public databases of CT images. A variant of the local binary pattern was used for calculating a feature vector. SVM or XGBoost was trained using the feature vector and its corresponding label. Tree Parzen Estimator (TPE) was used as Bayesian optimization for parameters of SVM and XGBoost. Random search was done for comparison with TPE. Leave-one-out cross-validation was used for optimizing and evaluating the performance of our CADx system. Performance was evaluated using area under the curve (AUC) of receiver operating characteristic analysis. AUC was calculated 10 times, and its average was obtained. The best averaged AUC of SVM and XGBoost was 0.850 and 0.896, respectively; both were obtained using TPE. XGBoost was generally superior to SVM. Optimal parameters for achieving high AUC were obtained with fewer numbers of trials when using TPE, compared with random search. Bayesian optimization of SVM and XGBoost parameters was more efficient than random search. Based on observer study, AUC values of two board-certified radiologists were 0.898 and 0.822. The results show that diagnostic accuracy of our CADx system was comparable to that of radiologists with respect to classifying lung nodules.http://europepmc.org/articles/PMC5908232?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Mizuho Nishio
Mitsuo Nishizawa
Osamu Sugiyama
Ryosuke Kojima
Masahiro Yakami
Tomohiro Kuroda
Kaori Togashi
spellingShingle Mizuho Nishio
Mitsuo Nishizawa
Osamu Sugiyama
Ryosuke Kojima
Masahiro Yakami
Tomohiro Kuroda
Kaori Togashi
Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.
PLoS ONE
author_facet Mizuho Nishio
Mitsuo Nishizawa
Osamu Sugiyama
Ryosuke Kojima
Masahiro Yakami
Tomohiro Kuroda
Kaori Togashi
author_sort Mizuho Nishio
title Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.
title_short Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.
title_full Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.
title_fullStr Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.
title_full_unstemmed Computer-aided diagnosis of lung nodule using gradient tree boosting and Bayesian optimization.
title_sort computer-aided diagnosis of lung nodule using gradient tree boosting and bayesian optimization.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2018-01-01
description We aimed to evaluate a computer-aided diagnosis (CADx) system for lung nodule classification focussing on (i) usefulness of the conventional CADx system (hand-crafted imaging feature + machine learning algorithm), (ii) comparison between support vector machine (SVM) and gradient tree boosting (XGBoost) as machine learning algorithms, and (iii) effectiveness of parameter optimization using Bayesian optimization and random search. Data on 99 lung nodules (62 lung cancers and 37 benign lung nodules) were included from public databases of CT images. A variant of the local binary pattern was used for calculating a feature vector. SVM or XGBoost was trained using the feature vector and its corresponding label. Tree Parzen Estimator (TPE) was used as Bayesian optimization for parameters of SVM and XGBoost. Random search was done for comparison with TPE. Leave-one-out cross-validation was used for optimizing and evaluating the performance of our CADx system. Performance was evaluated using area under the curve (AUC) of receiver operating characteristic analysis. AUC was calculated 10 times, and its average was obtained. The best averaged AUC of SVM and XGBoost was 0.850 and 0.896, respectively; both were obtained using TPE. XGBoost was generally superior to SVM. Optimal parameters for achieving high AUC were obtained with fewer numbers of trials when using TPE, compared with random search. Bayesian optimization of SVM and XGBoost parameters was more efficient than random search. Based on observer study, AUC values of two board-certified radiologists were 0.898 and 0.822. The results show that diagnostic accuracy of our CADx system was comparable to that of radiologists with respect to classifying lung nodules.
url http://europepmc.org/articles/PMC5908232?pdf=render
work_keys_str_mv AT mizuhonishio computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT mitsuonishizawa computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT osamusugiyama computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT ryosukekojima computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT masahiroyakami computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT tomohirokuroda computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
AT kaoritogashi computeraideddiagnosisoflungnoduleusinggradienttreeboostingandbayesianoptimization
_version_ 1725117754997997568