Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images

Objectives: The present study aimed to train deep convolutional neural networks (CNNs) to detect caries lesions on Near-Infrared Light Transillumination (NILT) imagery obtained either in vitro or in vivo and to assess the models’ generalizability. Methods: In vitro, 226 extracted posterior permanent...

Full description

Bibliographic Details
Main Authors: Agnes Holtkamp, Karim Elhennawy, José E. Cejudo Grano de Oro, Joachim Krois, Sebastian Paris, Falk Schwendicke
Format: Article
Language:English
Published: MDPI AG 2021-03-01
Series:Journal of Clinical Medicine
Subjects:
Online Access:https://www.mdpi.com/2077-0383/10/5/961
id doaj-c788f3705b1e4c9c91892c3320665852
record_format Article
spelling doaj-c788f3705b1e4c9c91892c33206658522021-03-02T00:04:38ZengMDPI AGJournal of Clinical Medicine2077-03832021-03-011096196110.3390/jcm10050961Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination ImagesAgnes Holtkamp0Karim Elhennawy1José E. Cejudo Grano de Oro2Joachim Krois3Sebastian Paris4Falk Schwendicke5Department of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 14197 Berlin, GermanyDepartment of Orthodontics, Dentofacial Orthopedics and Pedodontics, Charité—Universitätsmedizin Berlin, 14197 Berlin, GermanyDepartment of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 14197 Berlin, GermanyDepartment of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 14197 Berlin, GermanyDepartment of Operative and Preventive Dentistry, Charité—Universitätsmedizin Berlin, 14197 Berlin, GermanyDepartment of Oral Diagnostics, Digital Health and Health Services Research, Charité—Universitätsmedizin Berlin, 14197 Berlin, GermanyObjectives: The present study aimed to train deep convolutional neural networks (CNNs) to detect caries lesions on Near-Infrared Light Transillumination (NILT) imagery obtained either in vitro or in vivo and to assess the models’ generalizability. Methods: In vitro, 226 extracted posterior permanent human teeth were mounted in a diagnostic model in a dummy head. Then, NILT images were generated (DIAGNOcam, KaVo, Biberach), and images were segmented tooth-wise. In vivo, 1319 teeth from 56 patients were obtained and segmented similarly. Proximal caries lesions were annotated pixel-wise by three experienced dentists, reviewed by a fourth dentist, and then transformed into binary labels. We trained ResNet classification models on both in vivo and in vitro datasets and used 10-fold cross-validation for estimating the performance and generalizability of the models. We used GradCAM to increase explainability. Results: The tooth-level prevalence of caries lesions was 41% in vitro and 49% in vivo, respectively. Models trained and tested on in vivo data performed significantly better (mean ± SD accuracy: 0.78 ± 0.04) than those trained and tested on in vitro data (accuracy: 0.64 ± 0.15; <i>p</i> < 0.05). When tested in vitro, the models trained in vivo showed significantly lower accuracy (0.70 ± 0.01; <i>p</i> < 0.01). Similarly, when tested in vivo, models trained in vitro showed significantly lower accuracy (0.61 ± 0.04; <i>p</i> < 0.05). In both cases, this was due to decreases in sensitivity (by −27% for models trained in vivo and −10% for models trained in vitro). Conclusions: Using in vitro setups for generating NILT imagery and training CNNs comes with low accuracy and generalizability. Clinical significance: Studies employing in vitro imagery for developing deep learning models should be critically appraised for their generalizability. Applicable deep learning models for assessing NILT imagery should be trained on in vivo data.https://www.mdpi.com/2077-0383/10/5/961artificial intelligencecariesdiagnosticsdigital imaging/radiologymathematical modeling
collection DOAJ
language English
format Article
sources DOAJ
author Agnes Holtkamp
Karim Elhennawy
José E. Cejudo Grano de Oro
Joachim Krois
Sebastian Paris
Falk Schwendicke
spellingShingle Agnes Holtkamp
Karim Elhennawy
José E. Cejudo Grano de Oro
Joachim Krois
Sebastian Paris
Falk Schwendicke
Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
Journal of Clinical Medicine
artificial intelligence
caries
diagnostics
digital imaging/radiology
mathematical modeling
author_facet Agnes Holtkamp
Karim Elhennawy
José E. Cejudo Grano de Oro
Joachim Krois
Sebastian Paris
Falk Schwendicke
author_sort Agnes Holtkamp
title Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
title_short Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
title_full Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
title_fullStr Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
title_full_unstemmed Generalizability of Deep Learning Models for Caries Detection in Near-Infrared Light Transillumination Images
title_sort generalizability of deep learning models for caries detection in near-infrared light transillumination images
publisher MDPI AG
series Journal of Clinical Medicine
issn 2077-0383
publishDate 2021-03-01
description Objectives: The present study aimed to train deep convolutional neural networks (CNNs) to detect caries lesions on Near-Infrared Light Transillumination (NILT) imagery obtained either in vitro or in vivo and to assess the models’ generalizability. Methods: In vitro, 226 extracted posterior permanent human teeth were mounted in a diagnostic model in a dummy head. Then, NILT images were generated (DIAGNOcam, KaVo, Biberach), and images were segmented tooth-wise. In vivo, 1319 teeth from 56 patients were obtained and segmented similarly. Proximal caries lesions were annotated pixel-wise by three experienced dentists, reviewed by a fourth dentist, and then transformed into binary labels. We trained ResNet classification models on both in vivo and in vitro datasets and used 10-fold cross-validation for estimating the performance and generalizability of the models. We used GradCAM to increase explainability. Results: The tooth-level prevalence of caries lesions was 41% in vitro and 49% in vivo, respectively. Models trained and tested on in vivo data performed significantly better (mean ± SD accuracy: 0.78 ± 0.04) than those trained and tested on in vitro data (accuracy: 0.64 ± 0.15; <i>p</i> < 0.05). When tested in vitro, the models trained in vivo showed significantly lower accuracy (0.70 ± 0.01; <i>p</i> < 0.01). Similarly, when tested in vivo, models trained in vitro showed significantly lower accuracy (0.61 ± 0.04; <i>p</i> < 0.05). In both cases, this was due to decreases in sensitivity (by −27% for models trained in vivo and −10% for models trained in vitro). Conclusions: Using in vitro setups for generating NILT imagery and training CNNs comes with low accuracy and generalizability. Clinical significance: Studies employing in vitro imagery for developing deep learning models should be critically appraised for their generalizability. Applicable deep learning models for assessing NILT imagery should be trained on in vivo data.
topic artificial intelligence
caries
diagnostics
digital imaging/radiology
mathematical modeling
url https://www.mdpi.com/2077-0383/10/5/961
work_keys_str_mv AT agnesholtkamp generalizabilityofdeeplearningmodelsforcariesdetectioninnearinfraredlighttransilluminationimages
AT karimelhennawy generalizabilityofdeeplearningmodelsforcariesdetectioninnearinfraredlighttransilluminationimages
AT joseecejudogranodeoro generalizabilityofdeeplearningmodelsforcariesdetectioninnearinfraredlighttransilluminationimages
AT joachimkrois generalizabilityofdeeplearningmodelsforcariesdetectioninnearinfraredlighttransilluminationimages
AT sebastianparis generalizabilityofdeeplearningmodelsforcariesdetectioninnearinfraredlighttransilluminationimages
AT falkschwendicke generalizabilityofdeeplearningmodelsforcariesdetectioninnearinfraredlighttransilluminationimages
_version_ 1724245495656742912