An improved ensemble learning approach for the prediction of heart disease risk

Heart disease is the leading cause of death globally, and early detection is crucial in preventing the progression of the disease. In this paper, an improved machine learning method is proposed for the prediction of heart disease risk. The technique involves randomly partitioning the dataset into sm...

Full description

Bibliographic Details
Main Authors: Ibomoiye Domor Mienye, Yanxia Sun, Zenghui Wang
Format: Article
Language:English
Published: Elsevier 2020-01-01
Series:Informatics in Medicine Unlocked
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352914820304184
id doaj-e67d3217bce3439d82e7e13621dc72bf
record_format Article
spelling doaj-e67d3217bce3439d82e7e13621dc72bf2020-11-25T01:59:37ZengElsevierInformatics in Medicine Unlocked2352-91482020-01-0120100402An improved ensemble learning approach for the prediction of heart disease riskIbomoiye Domor Mienye0Yanxia Sun1Zenghui Wang2Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, 2006, South Africa; Corresponding author.Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, 2006, South AfricaDepartment of Electrical and Mining Engineering, University of South Africa, Florida, 1709, South AfricaHeart disease is the leading cause of death globally, and early detection is crucial in preventing the progression of the disease. In this paper, an improved machine learning method is proposed for the prediction of heart disease risk. The technique involves randomly partitioning the dataset into smaller subsets using a mean based splitting approach. The various partitions are then modeled using classification and regression tree (CART). A homogeneous ensemble is then created from the different CART models using an accuracy based weighted aging classifier ensemble, which is a modification of the weighted aging classifier ensemble (WAE). The approach ensures optimal performance is achieved. The experimental results on the Cleveland and Framingham datasets achieved classification accuracies of 93% and 91%, respectively, which outperformed other machine learning algorithms and similar scholarly works. The receiver operating characteristic curves further validates the improved performance of the proposed ensemble learning approach. The results show that heart disease risk can be predicted effectively by the proposed ensemble.http://www.sciencedirect.com/science/article/pii/S2352914820304184CARTData partitioningEnsemble learningHeart diseaseMachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Ibomoiye Domor Mienye
Yanxia Sun
Zenghui Wang
spellingShingle Ibomoiye Domor Mienye
Yanxia Sun
Zenghui Wang
An improved ensemble learning approach for the prediction of heart disease risk
Informatics in Medicine Unlocked
CART
Data partitioning
Ensemble learning
Heart disease
Machine learning
author_facet Ibomoiye Domor Mienye
Yanxia Sun
Zenghui Wang
author_sort Ibomoiye Domor Mienye
title An improved ensemble learning approach for the prediction of heart disease risk
title_short An improved ensemble learning approach for the prediction of heart disease risk
title_full An improved ensemble learning approach for the prediction of heart disease risk
title_fullStr An improved ensemble learning approach for the prediction of heart disease risk
title_full_unstemmed An improved ensemble learning approach for the prediction of heart disease risk
title_sort improved ensemble learning approach for the prediction of heart disease risk
publisher Elsevier
series Informatics in Medicine Unlocked
issn 2352-9148
publishDate 2020-01-01
description Heart disease is the leading cause of death globally, and early detection is crucial in preventing the progression of the disease. In this paper, an improved machine learning method is proposed for the prediction of heart disease risk. The technique involves randomly partitioning the dataset into smaller subsets using a mean based splitting approach. The various partitions are then modeled using classification and regression tree (CART). A homogeneous ensemble is then created from the different CART models using an accuracy based weighted aging classifier ensemble, which is a modification of the weighted aging classifier ensemble (WAE). The approach ensures optimal performance is achieved. The experimental results on the Cleveland and Framingham datasets achieved classification accuracies of 93% and 91%, respectively, which outperformed other machine learning algorithms and similar scholarly works. The receiver operating characteristic curves further validates the improved performance of the proposed ensemble learning approach. The results show that heart disease risk can be predicted effectively by the proposed ensemble.
topic CART
Data partitioning
Ensemble learning
Heart disease
Machine learning
url http://www.sciencedirect.com/science/article/pii/S2352914820304184
work_keys_str_mv AT ibomoiyedomormienye animprovedensemblelearningapproachforthepredictionofheartdiseaserisk
AT yanxiasun animprovedensemblelearningapproachforthepredictionofheartdiseaserisk
AT zenghuiwang animprovedensemblelearningapproachforthepredictionofheartdiseaserisk
AT ibomoiyedomormienye improvedensemblelearningapproachforthepredictionofheartdiseaserisk
AT yanxiasun improvedensemblelearningapproachforthepredictionofheartdiseaserisk
AT zenghuiwang improvedensemblelearningapproachforthepredictionofheartdiseaserisk
_version_ 1724963554016100352