A hybrid cost-sensitive ensemble for heart disease prediction

Abstract Background Heart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What’s more, the misclassification cost could be very high. Methods A...

Full description

Bibliographic Details
Main Authors: Qi Zhenya, Zuoru Zhang
Format: Article
Language:English
Published: BMC 2021-02-01
Series:BMC Medical Informatics and Decision Making
Subjects:
Online Access:https://doi.org/10.1186/s12911-021-01436-7
id doaj-e5967e2550c743edb353e2fd49d160eb
record_format Article
spelling doaj-e5967e2550c743edb353e2fd49d160eb2021-03-11T12:41:45ZengBMCBMC Medical Informatics and Decision Making1472-69472021-02-0121111810.1186/s12911-021-01436-7A hybrid cost-sensitive ensemble for heart disease predictionQi Zhenya0Zuoru Zhang1College of Management and Economics, Tianjin UniversitySchool of Mathematical Science, Hebei Normal UniversityAbstract Background Heart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What’s more, the misclassification cost could be very high. Methods A cost-sensitive ensemble method was proposed to improve the efficiency of diagnosis and reduce the misclassification cost. The proposed method contains five heterogeneous classifiers: random forest, logistic regression, support vector machine, extreme learning machine and k-nearest neighbor. T-test was used to investigate if the performance of the ensemble was better than individual classifiers and the contribution of Relief algorithm. Results The best performance was achieved by the proposed method according to ten-fold cross validation. The statistical tests demonstrated that the performance of the proposed ensemble was significantly superior to individual classifiers, and the efficiency of classification was distinctively improved by Relief algorithm. Conclusions The proposed ensemble gained significantly better results compared with individual classifiers and previous studies, which implies that it can be used as a promising alternative tool in medical decision making for heart disease diagnosis.https://doi.org/10.1186/s12911-021-01436-7Cost-sensitiveEnsembleHeart disease
collection DOAJ
language English
format Article
sources DOAJ
author Qi Zhenya
Zuoru Zhang
spellingShingle Qi Zhenya
Zuoru Zhang
A hybrid cost-sensitive ensemble for heart disease prediction
BMC Medical Informatics and Decision Making
Cost-sensitive
Ensemble
Heart disease
author_facet Qi Zhenya
Zuoru Zhang
author_sort Qi Zhenya
title A hybrid cost-sensitive ensemble for heart disease prediction
title_short A hybrid cost-sensitive ensemble for heart disease prediction
title_full A hybrid cost-sensitive ensemble for heart disease prediction
title_fullStr A hybrid cost-sensitive ensemble for heart disease prediction
title_full_unstemmed A hybrid cost-sensitive ensemble for heart disease prediction
title_sort hybrid cost-sensitive ensemble for heart disease prediction
publisher BMC
series BMC Medical Informatics and Decision Making
issn 1472-6947
publishDate 2021-02-01
description Abstract Background Heart disease is the primary cause of morbidity and mortality in the world. It includes numerous problems and symptoms. The diagnosis of heart disease is difficult because there are too many factors to analyze. What’s more, the misclassification cost could be very high. Methods A cost-sensitive ensemble method was proposed to improve the efficiency of diagnosis and reduce the misclassification cost. The proposed method contains five heterogeneous classifiers: random forest, logistic regression, support vector machine, extreme learning machine and k-nearest neighbor. T-test was used to investigate if the performance of the ensemble was better than individual classifiers and the contribution of Relief algorithm. Results The best performance was achieved by the proposed method according to ten-fold cross validation. The statistical tests demonstrated that the performance of the proposed ensemble was significantly superior to individual classifiers, and the efficiency of classification was distinctively improved by Relief algorithm. Conclusions The proposed ensemble gained significantly better results compared with individual classifiers and previous studies, which implies that it can be used as a promising alternative tool in medical decision making for heart disease diagnosis.
topic Cost-sensitive
Ensemble
Heart disease
url https://doi.org/10.1186/s12911-021-01436-7
work_keys_str_mv AT qizhenya ahybridcostsensitiveensembleforheartdiseaseprediction
AT zuoruzhang ahybridcostsensitiveensembleforheartdiseaseprediction
AT qizhenya hybridcostsensitiveensembleforheartdiseaseprediction
AT zuoruzhang hybridcostsensitiveensembleforheartdiseaseprediction
_version_ 1724224202963156992