Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians
BackgroundA hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
JMIR Publications
2013-05-01
|
Series: | Journal of Medical Internet Research |
Online Access: | http://www.jmir.org/2013/5/e98/ |
id |
doaj-6ab3c8af36e44af38e64b4ea91ea95e3 |
---|---|
record_format |
Article |
spelling |
doaj-6ab3c8af36e44af38e64b4ea91ea95e32021-04-02T21:36:01ZengJMIR PublicationsJournal of Medical Internet Research1438-88712013-05-01155e9810.2196/jmir.2495Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus CliniciansChen, Wei-HsinHsieh, Sheau-LingHsu, Kai-PingChen, Han-PingSu, Xing-YuTseng, Yi-JuChien, Yin-HsiuHwu, Wuh-LiangLai, Feipei BackgroundA hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification. ObjectiveThe objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism. MethodsThe framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases. ResultsThe feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency. ConclusionsThis SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically.http://www.jmir.org/2013/5/e98/ |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei |
spellingShingle |
Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians Journal of Medical Internet Research |
author_facet |
Chen, Wei-Hsin Hsieh, Sheau-Ling Hsu, Kai-Ping Chen, Han-Ping Su, Xing-Yu Tseng, Yi-Ju Chien, Yin-Hsiu Hwu, Wuh-Liang Lai, Feipei |
author_sort |
Chen, Wei-Hsin |
title |
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_short |
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_full |
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_fullStr |
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_full_unstemmed |
Web-Based Newborn Screening System for Metabolic Diseases: Machine Learning Versus Clinicians |
title_sort |
web-based newborn screening system for metabolic diseases: machine learning versus clinicians |
publisher |
JMIR Publications |
series |
Journal of Medical Internet Research |
issn |
1438-8871 |
publishDate |
2013-05-01 |
description |
BackgroundA hospital information system (HIS) that integrates screening data and interpretation of the data is routinely requested by hospitals and parents. However, the accuracy of disease classification may be low because of the disease characteristics and the analytes used for classification.
ObjectiveThe objective of this study is to describe a system that enhanced the neonatal screening system of the Newborn Screening Center at the National Taiwan University Hospital. The system was designed and deployed according to a service-oriented architecture (SOA) framework under the Web services .NET environment. The system consists of sample collection, testing, diagnosis, evaluation, treatment, and follow-up services among collaborating hospitals. To improve the accuracy of newborn screening, machine learning and optimal feature selection mechanisms were investigated for screening newborns for inborn errors of metabolism.
MethodsThe framework of the Newborn Screening Hospital Information System (NSHIS) used the embedded Health Level Seven (HL7) standards for data exchanges among heterogeneous platforms integrated by Web services in the C# language. In this study, machine learning classification was used to predict phenylketonuria (PKU), hypermethioninemia, and 3-methylcrotonyl-CoA-carboxylase (3-MCC) deficiency. The classification methods used 347,312 newborn dried blood samples collected at the Center between 2006 and 2011. Of these, 220 newborns had values over the diagnostic cutoffs (positive cases) and 1557 had values that were over the screening cutoffs but did not meet the diagnostic cutoffs (suspected cases). The original 35 analytes and the manifested features were ranked based on F score, then combinations of the top 20 ranked features were selected as input features to support vector machine (SVM) classifiers to obtain optimal feature sets. These feature sets were tested using 5-fold cross-validation and optimal models were generated. The datasets collected in year 2011 were used as predicting cases.
ResultsThe feature selection strategies were implemented and the optimal markers for PKU, hypermethioninemia, and 3-MCC deficiency were obtained. The results of the machine learning approach were compared with the cutoff scheme. The number of the false positive cases were reduced from 21 to 2 for PKU, from 30 to 10 for hypermethioninemia, and 209 to 46 for 3-MCC deficiency.
ConclusionsThis SOA Web service–based newborn screening system can accelerate screening procedures effectively and efficiently. An SVM learning methodology for PKU, hypermethioninemia, and 3-MCC deficiency metabolic diseases classification, including optimal feature selection strategies, is presented. By adopting the results of this study, the number of suspected cases could be reduced dramatically. |
url |
http://www.jmir.org/2013/5/e98/ |
work_keys_str_mv |
AT chenweihsin webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT hsiehsheauling webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT hsukaiping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT chenhanping webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT suxingyu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT tsengyiju webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT chienyinhsiu webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT hwuwuhliang webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians AT laifeipei webbasednewbornscreeningsystemformetabolicdiseasesmachinelearningversusclinicians |
_version_ |
1721545067160666112 |