Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis

Knowing when a machine learning system is not confident about its prediction is crucial in medical domains where safety is critical. Ideally, a machine learning algorithm should make a prediction only when it is highly certain about its competency, and refer the case to physicians otherwise. In this...

Full description

Bibliographic Details
Main Authors: Aryan Mobiny, Aditi Singh, Hien Van Nguyen
Format: Article
Language:English
Published: MDPI AG 2019-08-01
Series:Journal of Clinical Medicine
Subjects:
Online Access:https://www.mdpi.com/2077-0383/8/8/1241
id doaj-ad6a133255994d8e8b04115012b805f1
record_format Article
spelling doaj-ad6a133255994d8e8b04115012b805f12020-11-25T01:56:33ZengMDPI AGJournal of Clinical Medicine2077-03832019-08-0188124110.3390/jcm8081241jcm8081241Risk-Aware Machine Learning Classifier for Skin Lesion DiagnosisAryan Mobiny0Aditi Singh1Hien Van Nguyen2Department of Electrical and Computer Engineering, University of Houston, Houston, TX 77004, USADepartment of Electrical and Computer Engineering, University of Houston, Houston, TX 77004, USADepartment of Electrical and Computer Engineering, University of Houston, Houston, TX 77004, USAKnowing when a machine learning system is not confident about its prediction is crucial in medical domains where safety is critical. Ideally, a machine learning algorithm should make a prediction only when it is highly certain about its competency, and refer the case to physicians otherwise. In this paper, we investigate how Bayesian deep learning can improve the performance of the machine&#8722;physician team in the skin lesion classification task. We used the publicly available HAM10000 dataset, which includes samples from seven common skin lesion categories: Melanoma (MEL), Melanocytic Nevi (NV), Basal Cell Carcinoma (BCC), Actinic Keratoses and Intraepithelial Carcinoma (AKIEC), Benign Keratosis (BKL), Dermatofibroma (DF), and Vascular (VASC) lesions. Our experimental results show that Bayesian deep networks can boost the diagnostic performance of the standard DenseNet-169 model from 81.35% to 83.59% without incurring additional parameters or heavy computation. More importantly, a hybrid physician&#8722;machine workflow reaches a classification accuracy of <inline-formula> <math display="inline"> <semantics> <mrow> <mn>90</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> while only referring <inline-formula> <math display="inline"> <semantics> <mrow> <mn>35</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> of the cases to physicians. The findings are expected to generalize to other medical diagnosis applications. We believe that the availability of risk-aware machine learning methods will enable a wider adoption of machine learning technology in clinical settings.https://www.mdpi.com/2077-0383/8/8/1241Bayesian deep networkmodel uncertaintyMonte Carlo dropoutphysician-friendly machine learningskin lesion
collection DOAJ
language English
format Article
sources DOAJ
author Aryan Mobiny
Aditi Singh
Hien Van Nguyen
spellingShingle Aryan Mobiny
Aditi Singh
Hien Van Nguyen
Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis
Journal of Clinical Medicine
Bayesian deep network
model uncertainty
Monte Carlo dropout
physician-friendly machine learning
skin lesion
author_facet Aryan Mobiny
Aditi Singh
Hien Van Nguyen
author_sort Aryan Mobiny
title Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis
title_short Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis
title_full Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis
title_fullStr Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis
title_full_unstemmed Risk-Aware Machine Learning Classifier for Skin Lesion Diagnosis
title_sort risk-aware machine learning classifier for skin lesion diagnosis
publisher MDPI AG
series Journal of Clinical Medicine
issn 2077-0383
publishDate 2019-08-01
description Knowing when a machine learning system is not confident about its prediction is crucial in medical domains where safety is critical. Ideally, a machine learning algorithm should make a prediction only when it is highly certain about its competency, and refer the case to physicians otherwise. In this paper, we investigate how Bayesian deep learning can improve the performance of the machine&#8722;physician team in the skin lesion classification task. We used the publicly available HAM10000 dataset, which includes samples from seven common skin lesion categories: Melanoma (MEL), Melanocytic Nevi (NV), Basal Cell Carcinoma (BCC), Actinic Keratoses and Intraepithelial Carcinoma (AKIEC), Benign Keratosis (BKL), Dermatofibroma (DF), and Vascular (VASC) lesions. Our experimental results show that Bayesian deep networks can boost the diagnostic performance of the standard DenseNet-169 model from 81.35% to 83.59% without incurring additional parameters or heavy computation. More importantly, a hybrid physician&#8722;machine workflow reaches a classification accuracy of <inline-formula> <math display="inline"> <semantics> <mrow> <mn>90</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> while only referring <inline-formula> <math display="inline"> <semantics> <mrow> <mn>35</mn> <mo>%</mo> </mrow> </semantics> </math> </inline-formula> of the cases to physicians. The findings are expected to generalize to other medical diagnosis applications. We believe that the availability of risk-aware machine learning methods will enable a wider adoption of machine learning technology in clinical settings.
topic Bayesian deep network
model uncertainty
Monte Carlo dropout
physician-friendly machine learning
skin lesion
url https://www.mdpi.com/2077-0383/8/8/1241
work_keys_str_mv AT aryanmobiny riskawaremachinelearningclassifierforskinlesiondiagnosis
AT aditisingh riskawaremachinelearningclassifierforskinlesiondiagnosis
AT hienvannguyen riskawaremachinelearningclassifierforskinlesiondiagnosis
_version_ 1724979421044015104