Predicting coronary artery disease: a comparison between two data mining algorithms

Abstract Background Cardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting...

Full description

Bibliographic Details
Main Authors:	Haleh Ayatollahi, Leila Gholamhosseini, Masoud Salehi
Format:	Article
Language:	English
Published:	BMC 2019-04-01
Series:	BMC Public Health
Subjects:	Coronary artery disease (CAD) Data mining algorithms Artificial neural network (ANN) Support vector machine (SVM)
Online Access:	http://link.springer.com/article/10.1186/s12889-019-6721-5

id	doaj-cbf5999b7ee549c89de4ece704b957b4
record_format	Article
spelling	doaj-cbf5999b7ee549c89de4ece704b957b42020-11-25T02:12:52ZengBMCBMC Public Health1471-24582019-04-011911910.1186/s12889-019-6721-5Predicting coronary artery disease: a comparison between two data mining algorithmsHaleh Ayatollahi0Leila Gholamhosseini1Masoud Salehi2Health Management and Economics Research Center, Iran University of Medical SciencesDepartment of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical SciencesDepartment of Biostatistics, School of Public Health, Iran University of Medical SciencesAbstract Background Cardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting coronary artery diseases. Therefore, the present study aimed to compare the positive predictive value (PPV) of CAD using artificial neural network (ANN) and SVM algorithms and their distinction in terms of predicting CAD in the selected hospitals. Methods The present study was conducted by using data mining techniques. The research sample was the medical records of the patients with coronary artery disease who were hospitalized in three hospitals affiliated to AJA University of Medical Sciences between March 2016 and March 2017 (n = 1324). The dataset and the predicting variables used in this study was the same for both data mining techniques. Totally, 25 variables affecting CAD were selected and related data were extracted. After normalizing and cleaning the data, they were entered into SPSS (V23.0) and Excel 2013. Then, R 3.3.2 was used for statistical computing. Results The SVM model had lower MAPE (112.03), higher Hosmer-Lemeshow test’s result (16.71), and higher sensitivity (92.23). Moreover, variables affecting CAD (74.42) yielded better goodness of fit in SVM model and provided more accurate result than the ANN model. On the other hand, since the area under the receiver operating characteristic (ROC) curve in the SVM algorithm was more than this area in ANN model, it could be concluded that SVM model had higher accuracy than the ANN model. Conclusion According to the results, the SVM algorithm presented higher accuracy and better performance than the ANN model and was characterized with higher power and sensitivity. Overall, it provided a better classification for the prediction of CAD. The use of other data mining algorithms are suggested to improve the positive predictive value of the disease prediction.http://link.springer.com/article/10.1186/s12889-019-6721-5Coronary artery disease (CAD)Data mining algorithmsArtificial neural network (ANN)Support vector machine (SVM)
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Haleh Ayatollahi Leila Gholamhosseini Masoud Salehi
spellingShingle	Haleh Ayatollahi Leila Gholamhosseini Masoud Salehi Predicting coronary artery disease: a comparison between two data mining algorithms BMC Public Health Coronary artery disease (CAD) Data mining algorithms Artificial neural network (ANN) Support vector machine (SVM)
author_facet	Haleh Ayatollahi Leila Gholamhosseini Masoud Salehi
author_sort	Haleh Ayatollahi
title	Predicting coronary artery disease: a comparison between two data mining algorithms
title_short	Predicting coronary artery disease: a comparison between two data mining algorithms
title_full	Predicting coronary artery disease: a comparison between two data mining algorithms
title_fullStr	Predicting coronary artery disease: a comparison between two data mining algorithms
title_full_unstemmed	Predicting coronary artery disease: a comparison between two data mining algorithms
title_sort	predicting coronary artery disease: a comparison between two data mining algorithms
publisher	BMC
series	BMC Public Health
issn	1471-2458
publishDate	2019-04-01
description	Abstract Background Cardiovascular diseases (CADs) are the first leading cause of death across the world. World Health Organization has estimated that morality rate caused by heart diseases will mount to 23 million cases by 2030. Hence, the use of data mining algorithms could be useful in predicting coronary artery diseases. Therefore, the present study aimed to compare the positive predictive value (PPV) of CAD using artificial neural network (ANN) and SVM algorithms and their distinction in terms of predicting CAD in the selected hospitals. Methods The present study was conducted by using data mining techniques. The research sample was the medical records of the patients with coronary artery disease who were hospitalized in three hospitals affiliated to AJA University of Medical Sciences between March 2016 and March 2017 (n = 1324). The dataset and the predicting variables used in this study was the same for both data mining techniques. Totally, 25 variables affecting CAD were selected and related data were extracted. After normalizing and cleaning the data, they were entered into SPSS (V23.0) and Excel 2013. Then, R 3.3.2 was used for statistical computing. Results The SVM model had lower MAPE (112.03), higher Hosmer-Lemeshow test’s result (16.71), and higher sensitivity (92.23). Moreover, variables affecting CAD (74.42) yielded better goodness of fit in SVM model and provided more accurate result than the ANN model. On the other hand, since the area under the receiver operating characteristic (ROC) curve in the SVM algorithm was more than this area in ANN model, it could be concluded that SVM model had higher accuracy than the ANN model. Conclusion According to the results, the SVM algorithm presented higher accuracy and better performance than the ANN model and was characterized with higher power and sensitivity. Overall, it provided a better classification for the prediction of CAD. The use of other data mining algorithms are suggested to improve the positive predictive value of the disease prediction.
topic	Coronary artery disease (CAD) Data mining algorithms Artificial neural network (ANN) Support vector machine (SVM)
url	http://link.springer.com/article/10.1186/s12889-019-6721-5
work_keys_str_mv	AT halehayatollahi predictingcoronaryarterydiseaseacomparisonbetweentwodataminingalgorithms AT leilagholamhosseini predictingcoronaryarterydiseaseacomparisonbetweentwodataminingalgorithms AT masoudsalehi predictingcoronaryarterydiseaseacomparisonbetweentwodataminingalgorithms
_version_	1724907759635267584

Predicting coronary artery disease: a comparison between two data mining algorithms

Similar Items