A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography

Abstract Machine learning (ML) holds great promise in transforming healthcare. While published studies have shown the utility of ML models in interpreting medical imaging examinations, these are often evaluated under laboratory settings. The importance of real world evaluation is best illustrated by...

Full description

Bibliographic Details
Main Authors: Hojjat Salehinejad, Jumpei Kitamura, Noah Ditkofsky, Amy Lin, Aditya Bharatha, Suradech Suthiphosuwan, Hui-Ming Lin, Jefferson R. Wilson, Muhammad Mamdani, Errol Colak
Format: Article
Language:English
Published: Nature Publishing Group 2021-08-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-95533-2
id doaj-0a90015727c949899ea11517db07fa81
record_format Article
spelling doaj-0a90015727c949899ea11517db07fa812021-08-29T11:25:28ZengNature Publishing GroupScientific Reports2045-23222021-08-0111111110.1038/s41598-021-95533-2A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomographyHojjat Salehinejad0Jumpei Kitamura1Noah Ditkofsky2Amy Lin3Aditya Bharatha4Suradech Suthiphosuwan5Hui-Ming Lin6Jefferson R. Wilson7Muhammad Mamdani8Errol Colak9Li Ka Shing Centre for Healthcare Analytics Research and Training, St. Michael’s HospitalFujisawaDepartment of Medical Imaging, St. Michael’s Hospital, Unity Health TorontoDepartment of Medical Imaging, St. Michael’s Hospital, Unity Health TorontoLi Ka Shing Centre for Healthcare Analytics Research and Training, St. Michael’s HospitalDepartment of Medical Imaging, St. Michael’s Hospital, Unity Health TorontoLi Ka Shing Centre for Healthcare Analytics Research and Training, St. Michael’s HospitalLi Ka Shing Centre for Healthcare Analytics Research and Training, St. Michael’s HospitalLi Ka Shing Centre for Healthcare Analytics Research and Training, St. Michael’s HospitalLi Ka Shing Centre for Healthcare Analytics Research and Training, St. Michael’s HospitalAbstract Machine learning (ML) holds great promise in transforming healthcare. While published studies have shown the utility of ML models in interpreting medical imaging examinations, these are often evaluated under laboratory settings. The importance of real world evaluation is best illustrated by case studies that have documented successes and failures in the translation of these models into clinical environments. A key prerequisite for the clinical adoption of these technologies is demonstrating generalizable ML model performance under real world circumstances. The purpose of this study was to demonstrate that ML model generalizability is achievable in medical imaging with the detection of intracranial hemorrhage (ICH) on non-contrast computed tomography (CT) scans serving as the use case. An ML model was trained using 21,784 scans from the RSNA Intracranial Hemorrhage CT dataset while generalizability was evaluated using an external validation dataset obtained from our busy trauma and neurosurgical center. This real world external validation dataset consisted of every unenhanced head CT scan (n = 5965) performed in our emergency department in 2019 without exclusion. The model demonstrated an AUC of 98.4%, sensitivity of 98.8%, and specificity of 98.0%, on the test dataset. On external validation, the model demonstrated an AUC of 95.4%, sensitivity of 91.3%, and specificity of 94.1%. Evaluating the ML model using a real world external validation dataset that is temporally and geographically distinct from the training dataset indicates that ML generalizability is achievable in medical imaging applications.https://doi.org/10.1038/s41598-021-95533-2
collection DOAJ
language English
format Article
sources DOAJ
author Hojjat Salehinejad
Jumpei Kitamura
Noah Ditkofsky
Amy Lin
Aditya Bharatha
Suradech Suthiphosuwan
Hui-Ming Lin
Jefferson R. Wilson
Muhammad Mamdani
Errol Colak
spellingShingle Hojjat Salehinejad
Jumpei Kitamura
Noah Ditkofsky
Amy Lin
Aditya Bharatha
Suradech Suthiphosuwan
Hui-Ming Lin
Jefferson R. Wilson
Muhammad Mamdani
Errol Colak
A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
Scientific Reports
author_facet Hojjat Salehinejad
Jumpei Kitamura
Noah Ditkofsky
Amy Lin
Aditya Bharatha
Suradech Suthiphosuwan
Hui-Ming Lin
Jefferson R. Wilson
Muhammad Mamdani
Errol Colak
author_sort Hojjat Salehinejad
title A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
title_short A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
title_full A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
title_fullStr A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
title_full_unstemmed A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
title_sort real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-08-01
description Abstract Machine learning (ML) holds great promise in transforming healthcare. While published studies have shown the utility of ML models in interpreting medical imaging examinations, these are often evaluated under laboratory settings. The importance of real world evaluation is best illustrated by case studies that have documented successes and failures in the translation of these models into clinical environments. A key prerequisite for the clinical adoption of these technologies is demonstrating generalizable ML model performance under real world circumstances. The purpose of this study was to demonstrate that ML model generalizability is achievable in medical imaging with the detection of intracranial hemorrhage (ICH) on non-contrast computed tomography (CT) scans serving as the use case. An ML model was trained using 21,784 scans from the RSNA Intracranial Hemorrhage CT dataset while generalizability was evaluated using an external validation dataset obtained from our busy trauma and neurosurgical center. This real world external validation dataset consisted of every unenhanced head CT scan (n = 5965) performed in our emergency department in 2019 without exclusion. The model demonstrated an AUC of 98.4%, sensitivity of 98.8%, and specificity of 98.0%, on the test dataset. On external validation, the model demonstrated an AUC of 95.4%, sensitivity of 91.3%, and specificity of 94.1%. Evaluating the ML model using a real world external validation dataset that is temporally and geographically distinct from the training dataset indicates that ML generalizability is achievable in medical imaging applications.
url https://doi.org/10.1038/s41598-021-95533-2
work_keys_str_mv AT hojjatsalehinejad arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT jumpeikitamura arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT noahditkofsky arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT amylin arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT adityabharatha arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT suradechsuthiphosuwan arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT huiminglin arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT jeffersonrwilson arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT muhammadmamdani arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT errolcolak arealworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT hojjatsalehinejad realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT jumpeikitamura realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT noahditkofsky realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT amylin realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT adityabharatha realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT suradechsuthiphosuwan realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT huiminglin realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT jeffersonrwilson realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT muhammadmamdani realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
AT errolcolak realworlddemonstrationofmachinelearninggeneralizabilityinthedetectionofintracranialhemorrhageonheadcomputerizedtomography
_version_ 1721186776352030720