A practical model for the identification of congenital cataracts using machine learning

Background: Approximately 1 in 33 newborns is affected by congenital anomalies worldwide. We aimed to develop a practical model for identifying infants with a high risk of congenital cataracts (CCs), which is the leading cause of avoidable childhood blindness. Methods: This case-control study was pe...

Full description

Bibliographic Details
Main Authors: Duoru Lin, Jingjing Chen, Zhuoling Lin, Xiaoyan Li, Kai Zhang, Xiaohang Wu, Zhenzhen Liu, Jialing Huang, Jing Li, Yi Zhu, Chuan Chen, Lanqin Zhao, Yifan Xiang, Chong Guo, Liming Wang, Yizhi Liu, Weirong Chen, Haotian Lin
Format: Article
Language:English
Published: Elsevier 2020-01-01
Series:EBioMedicine
Online Access:http://www.sciencedirect.com/science/article/pii/S2352396419308369
id doaj-d65c61c312b342ca8148a872f211e836
record_format Article
collection DOAJ
language English
format Article
sources DOAJ
author Duoru Lin
Jingjing Chen
Zhuoling Lin
Xiaoyan Li
Kai Zhang
Xiaohang Wu
Zhenzhen Liu
Jialing Huang
Jing Li
Yi Zhu
Chuan Chen
Lanqin Zhao
Yifan Xiang
Chong Guo
Liming Wang
Yizhi Liu
Weirong Chen
Haotian Lin
spellingShingle Duoru Lin
Jingjing Chen
Zhuoling Lin
Xiaoyan Li
Kai Zhang
Xiaohang Wu
Zhenzhen Liu
Jialing Huang
Jing Li
Yi Zhu
Chuan Chen
Lanqin Zhao
Yifan Xiang
Chong Guo
Liming Wang
Yizhi Liu
Weirong Chen
Haotian Lin
A practical model for the identification of congenital cataracts using machine learning
EBioMedicine
author_facet Duoru Lin
Jingjing Chen
Zhuoling Lin
Xiaoyan Li
Kai Zhang
Xiaohang Wu
Zhenzhen Liu
Jialing Huang
Jing Li
Yi Zhu
Chuan Chen
Lanqin Zhao
Yifan Xiang
Chong Guo
Liming Wang
Yizhi Liu
Weirong Chen
Haotian Lin
author_sort Duoru Lin
title A practical model for the identification of congenital cataracts using machine learning
title_short A practical model for the identification of congenital cataracts using machine learning
title_full A practical model for the identification of congenital cataracts using machine learning
title_fullStr A practical model for the identification of congenital cataracts using machine learning
title_full_unstemmed A practical model for the identification of congenital cataracts using machine learning
title_sort practical model for the identification of congenital cataracts using machine learning
publisher Elsevier
series EBioMedicine
issn 2352-3964
publishDate 2020-01-01
description Background: Approximately 1 in 33 newborns is affected by congenital anomalies worldwide. We aimed to develop a practical model for identifying infants with a high risk of congenital cataracts (CCs), which is the leading cause of avoidable childhood blindness. Methods: This case-control study was performed in the Zhongshan Ophthalmic Center and involved 2005 subjects, including 1274 children with CCs and 731 healthy controls. The CC identification models were established based on birth conditions, family medical history, and family environmental factors using the random forest (RF) and adaptive boosting methods (trained by 1129 CC cases and 609 healthy controls), which were tested by internal 4-fold cross-validation and external validation (145 CC cases and 122 healthy controls). The models were also tested using 4 datasets with gradually reduced proportions of CC patients (bilateral cases) to validate their performance in an approximate simulation of a clinical environment with a relatively low disease prevalence. Findings: The CC identification models showed high discrimination in both the 4-fold cross validation (area under the curve (AUC)=0.91 [95% confidence interval: 0.88–0.94] in bilateral cases; 0.82 [0.77–0.89] in unilateral cases) and external validation (AUC=0.93±0.05 in bilateral cases; 0.86±0.01 in unilateral cases), and achieved stable performance in the clinical tests (AUC=0.94–0.96 in the four subgroups by RF). Furthermore, family history of CC, low parental education level, and comorbidity were identified as the top three most relevant factors to both bilateral and unilateral CC diagnosis. Interpretation: Our CC identification models can accurately discriminate CC patients from healthy children and have the potential to serve as a complementary screening procedure, especially in undeveloped and remote areas. Keywords: Congenital anomaly, Congenital cataract, Identification model, Machine learning
url http://www.sciencedirect.com/science/article/pii/S2352396419308369
work_keys_str_mv AT duorulin apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT jingjingchen apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT zhuolinglin apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT xiaoyanli apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT kaizhang apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT xiaohangwu apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT zhenzhenliu apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT jialinghuang apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT jingli apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT yizhu apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT chuanchen apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT lanqinzhao apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT yifanxiang apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT chongguo apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT limingwang apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT yizhiliu apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT weirongchen apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT haotianlin apracticalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT duorulin practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT jingjingchen practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT zhuolinglin practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT xiaoyanli practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT kaizhang practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT xiaohangwu practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT zhenzhenliu practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT jialinghuang practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT jingli practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT yizhu practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT chuanchen practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT lanqinzhao practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT yifanxiang practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT chongguo practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT limingwang practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT yizhiliu practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT weirongchen practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
AT haotianlin practicalmodelfortheidentificationofcongenitalcataractsusingmachinelearning
_version_ 1725993347870359552
spelling doaj-d65c61c312b342ca8148a872f211e8362020-11-24T21:23:09ZengElsevierEBioMedicine2352-39642020-01-0151A practical model for the identification of congenital cataracts using machine learningDuoru Lin0Jingjing Chen1Zhuoling Lin2Xiaoyan Li3Kai Zhang4Xiaohang Wu5Zhenzhen Liu6Jialing Huang7Jing Li8Yi Zhu9Chuan Chen10Lanqin Zhao11Yifan Xiang12Chong Guo13Liming Wang14Yizhi Liu15Weirong Chen16Haotian Lin17State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of China; School of Computer Science and Technology, Xidian University, Xi'an, Shanxi 710071, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaSchool of Public Health, Sun Yat-sen University, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of China; Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USAState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of China; Department of Molecular and Cellular Pharmacology, University of Miami Miller School of Medicine, Miami, FL 33136, USAState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaSchool of Computer Science and Technology, Xidian University, Xi'an, Shanxi 710071, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of ChinaState Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of China; Corresponding author.State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Jinsui Road #7, Guangzhou, Guangdong 510060, People's Republic of China; Corresponding author.Background: Approximately 1 in 33 newborns is affected by congenital anomalies worldwide. We aimed to develop a practical model for identifying infants with a high risk of congenital cataracts (CCs), which is the leading cause of avoidable childhood blindness. Methods: This case-control study was performed in the Zhongshan Ophthalmic Center and involved 2005 subjects, including 1274 children with CCs and 731 healthy controls. The CC identification models were established based on birth conditions, family medical history, and family environmental factors using the random forest (RF) and adaptive boosting methods (trained by 1129 CC cases and 609 healthy controls), which were tested by internal 4-fold cross-validation and external validation (145 CC cases and 122 healthy controls). The models were also tested using 4 datasets with gradually reduced proportions of CC patients (bilateral cases) to validate their performance in an approximate simulation of a clinical environment with a relatively low disease prevalence. Findings: The CC identification models showed high discrimination in both the 4-fold cross validation (area under the curve (AUC)=0.91 [95% confidence interval: 0.88–0.94] in bilateral cases; 0.82 [0.77–0.89] in unilateral cases) and external validation (AUC=0.93±0.05 in bilateral cases; 0.86±0.01 in unilateral cases), and achieved stable performance in the clinical tests (AUC=0.94–0.96 in the four subgroups by RF). Furthermore, family history of CC, low parental education level, and comorbidity were identified as the top three most relevant factors to both bilateral and unilateral CC diagnosis. Interpretation: Our CC identification models can accurately discriminate CC patients from healthy children and have the potential to serve as a complementary screening procedure, especially in undeveloped and remote areas. Keywords: Congenital anomaly, Congenital cataract, Identification model, Machine learninghttp://www.sciencedirect.com/science/article/pii/S2352396419308369