A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis

Melanoma is the most invasive skin cancer with the highest risk of death. While it is a serious skin cancer, it is highly curable if detected early. Melanoma diagnosis is difficult, even for experienced dermatologists, due to the wide range of morphologies in skin lesions. Given the rapid developmen...

Full description

Bibliographic Details
Main Authors: Saeed Alzahrani, Baidaa Al-Bander, Waleed Al-Nuaimy
Format: Article
Language:English
Published: MDPI AG 2021-09-01
Series:Cancers
Subjects:
Online Access:https://www.mdpi.com/2072-6694/13/17/4494
id doaj-ca9623a124814657b4af3c1b60216779
record_format Article
spelling doaj-ca9623a124814657b4af3c1b602167792021-09-09T13:41:14ZengMDPI AGCancers2072-66942021-09-01134494449410.3390/cancers13174494A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma DiagnosisSaeed Alzahrani0Baidaa Al-Bander1Waleed Al-Nuaimy2Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, UKDepartment of Computer Engineering, University of Diyala, Baqubah 32010, IraqDepartment of Electrical Engineering and Electronics, University of Liverpool, Liverpool L69 3GJ, UKMelanoma is the most invasive skin cancer with the highest risk of death. While it is a serious skin cancer, it is highly curable if detected early. Melanoma diagnosis is difficult, even for experienced dermatologists, due to the wide range of morphologies in skin lesions. Given the rapid development of deep learning algorithms for melanoma diagnosis, it is crucial to validate and benchmark these models, which is the main challenge of this work. This research presents a new benchmarking and selection approach based on the multi-criteria analysis method (MCDM), which integrates entropy and the preference ranking organization method for enrichment of evaluations (PROMETHEE) methods. The experimental study is carried out in four phases. Firstly, 19 convolution neural networks (CNNs) are trained and evaluated on a public dataset of 991 dermoscopic images. Secondly, to obtain the decision matrix, 10 criteria, including accuracy, classification error, precision, sensitivity, specificity, F1-score, false-positive rate, false-negative rate, Matthews correlation coefficient (MCC), and the number of parameters are established. Third, entropy and PROMETHEE methods are integrated to determine the weights of criteria and rank the models. Fourth, the proposed benchmarking framework is validated using the VIKOR method. The obtained results reveal that the ResNet101 model is selected as the optimal diagnosis model for melanoma in our case study data. Thus, the presented benchmarking framework is proven to be useful at exposing the optimal melanoma diagnosis model targeting to ease the selection process of the proper convolutional neural network architecture.https://www.mdpi.com/2072-6694/13/17/4494melanomaconvolution neural networksbenchmarking
collection DOAJ
language English
format Article
sources DOAJ
author Saeed Alzahrani
Baidaa Al-Bander
Waleed Al-Nuaimy
spellingShingle Saeed Alzahrani
Baidaa Al-Bander
Waleed Al-Nuaimy
A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis
Cancers
melanoma
convolution neural networks
benchmarking
author_facet Saeed Alzahrani
Baidaa Al-Bander
Waleed Al-Nuaimy
author_sort Saeed Alzahrani
title A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis
title_short A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis
title_full A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis
title_fullStr A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis
title_full_unstemmed A Comprehensive Evaluation and Benchmarking of Convolutional Neural Networks for Melanoma Diagnosis
title_sort comprehensive evaluation and benchmarking of convolutional neural networks for melanoma diagnosis
publisher MDPI AG
series Cancers
issn 2072-6694
publishDate 2021-09-01
description Melanoma is the most invasive skin cancer with the highest risk of death. While it is a serious skin cancer, it is highly curable if detected early. Melanoma diagnosis is difficult, even for experienced dermatologists, due to the wide range of morphologies in skin lesions. Given the rapid development of deep learning algorithms for melanoma diagnosis, it is crucial to validate and benchmark these models, which is the main challenge of this work. This research presents a new benchmarking and selection approach based on the multi-criteria analysis method (MCDM), which integrates entropy and the preference ranking organization method for enrichment of evaluations (PROMETHEE) methods. The experimental study is carried out in four phases. Firstly, 19 convolution neural networks (CNNs) are trained and evaluated on a public dataset of 991 dermoscopic images. Secondly, to obtain the decision matrix, 10 criteria, including accuracy, classification error, precision, sensitivity, specificity, F1-score, false-positive rate, false-negative rate, Matthews correlation coefficient (MCC), and the number of parameters are established. Third, entropy and PROMETHEE methods are integrated to determine the weights of criteria and rank the models. Fourth, the proposed benchmarking framework is validated using the VIKOR method. The obtained results reveal that the ResNet101 model is selected as the optimal diagnosis model for melanoma in our case study data. Thus, the presented benchmarking framework is proven to be useful at exposing the optimal melanoma diagnosis model targeting to ease the selection process of the proper convolutional neural network architecture.
topic melanoma
convolution neural networks
benchmarking
url https://www.mdpi.com/2072-6694/13/17/4494
work_keys_str_mv AT saeedalzahrani acomprehensiveevaluationandbenchmarkingofconvolutionalneuralnetworksformelanomadiagnosis
AT baidaaalbander acomprehensiveevaluationandbenchmarkingofconvolutionalneuralnetworksformelanomadiagnosis
AT waleedalnuaimy acomprehensiveevaluationandbenchmarkingofconvolutionalneuralnetworksformelanomadiagnosis
AT saeedalzahrani comprehensiveevaluationandbenchmarkingofconvolutionalneuralnetworksformelanomadiagnosis
AT baidaaalbander comprehensiveevaluationandbenchmarkingofconvolutionalneuralnetworksformelanomadiagnosis
AT waleedalnuaimy comprehensiveevaluationandbenchmarkingofconvolutionalneuralnetworksformelanomadiagnosis
_version_ 1717760667617853440