Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians

BackgroundA hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used...

Full description

Bibliographic Details
Main Authors: Chen, Wei-Hsin, Hsieh, Sheau-Ling, Hsu, Kai-Ping, Chen, Han-Ping, Su, Xing-Yu, Tseng, Yi-Ju, Chien, Yin-Hsiu, Hwu, Wuh-Liang, Lai, Feipei
Format: Article
Language:English
Published: JMIR Publications 2013-05-01
Series:Journal of Medical Internet Research
Online Access:http://www.jmir.org/2013/5/e98/
id doaj-6ab3c8af36e44af38e64b4ea91ea95e3
record_format Article
spelling doaj-6ab3c8af36e44af38e64b4ea91ea95e32021-04-02T21:36:01ZengJMIR PublicationsJournal of Medical Internet Research1438-88712013-05-01155e9810.2196/jmir.2495Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus CliniciansChen, Wei-HsinHsieh, Sheau-LingHsu, Kai-PingChen, Han-PingSu, Xing-YuTseng, Yi-JuChien, Yin-HsiuHwu, Wuh-LiangLai, Feipei BackgroundA hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. ObjectiveThe objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. MethodsThe framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. ResultsThe feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. ConclusionsThis SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically.http://www.jmir.org/2013/5/e98/
collection DOAJ
language English
format Article
sources DOAJ
author Chen, Wei-Hsin
Hsieh, Sheau-Ling
Hsu, Kai-Ping
Chen, Han-Ping
Su, Xing-Yu
Tseng, Yi-Ju
Chien, Yin-Hsiu
Hwu, Wuh-Liang
Lai, Feipei
spellingShingle Chen, Wei-Hsin
Hsieh, Sheau-Ling
Hsu, Kai-Ping
Chen, Han-Ping
Su, Xing-Yu
Tseng, Yi-Ju
Chien, Yin-Hsiu
Hwu, Wuh-Liang
Lai, Feipei
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
Journal of Medical Internet Research
author_facet Chen, Wei-Hsin
Hsieh, Sheau-Ling
Hsu, Kai-Ping
Chen, Han-Ping
Su, Xing-Yu
Tseng, Yi-Ju
Chien, Yin-Hsiu
Hwu, Wuh-Liang
Lai, Feipei
author_sort Chen, Wei-Hsin
title Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_short Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_full Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_fullStr Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_full_unstemmed Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
title_sort web-based newborn screening system for metabolic diseases: machine learning versus clinicians
publisher JMIR Publications
series Journal of Medical Internet Research
issn 1438-8871
publishDate 2013-05-01
description BackgroundA hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. ObjectiveThe objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. MethodsThe framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. ResultsThe feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. ConclusionsThis SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically.
url http://www.jmir.org/2013/5/e98/
work_keys_str_mv AT chenweihsin webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT hsiehsheauling webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT hsukaiping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT chenhanping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT suxingyu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT tsengyiju webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT chienyinhsiu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT hwuwuhliang webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
AT laifeipei webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians
_version_ 1721545067160666112