Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance

<b> </b>Cardiovascular diseases are the main cause of death worldwide. The aim of the present study is to verify the performances of a data mining methodology in the evaluation of cardiovascular risk in athletes, and whether the results may be used to support clinical decision making. An...

Full description

Bibliographic Details
Main Authors: Davide Barbieri, Nitesh Chawla, Luciana Zaccagni, Tonći Grgurinović, Jelena Šarac, Miran Čoklo, Saša Missoni
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:International Journal of Environmental Research and Public Health
Subjects:
Online Access:https://www.mdpi.com/1660-4601/17/21/7923
id doaj-d0ffa4555da246278a01155f7d43bd27
record_format Article
spelling doaj-d0ffa4555da246278a01155f7d43bd272020-11-25T03:58:32ZengMDPI AGInternational Journal of Environmental Research and Public Health1661-78271660-46012020-10-01177923792310.3390/ijerph17217923Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification PerformanceDavide Barbieri0Nitesh Chawla1Luciana Zaccagni2Tonći Grgurinović3Jelena Šarac4Miran Čoklo5Saša Missoni6Department of Biomedical and Specialty Surgical Sciences, Faculty of Medicine, Pharmacy and Prevention, University of Ferrara, 44121 Ferrara, ItalyInterdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN 46556, USADepartment of Biomedical and Specialty Surgical Sciences, Faculty of Medicine, Pharmacy and Prevention, University of Ferrara, 44121 Ferrara, ItalyPolyclinic for Occupational Health and Sports of Zagreb Sports Association with Laboratory of Medical Biochemistry, 10000 Zagreb, CroatiaCentre for Applied Bioanthropology, Institute for Anthropological Research, 10000 Zagreb, CroatiaCentre for Applied Bioanthropology, Institute for Anthropological Research, 10000 Zagreb, CroatiaInstitute for Anthropological Research, 10000 Zagreb, Croatia<b> </b>Cardiovascular diseases are the main cause of death worldwide. The aim of the present study is to verify the performances of a data mining methodology in the evaluation of cardiovascular risk in athletes, and whether the results may be used to support clinical decision making. Anthropometric (height and weight), demographic (age and sex) and biomedical (blood pressure and pulse rate) data of 26,002 athletes were collected in 2012 during routine sport medical examinations, which included electrocardiography at rest. Subjects were involved in competitive sport practice, for which medical clearance was needed. Outcomes were negative for the largest majority, as expected in an active population. Resampling was applied to balance positive/negative class ratio. A decision tree and logistic regression were used to classify individuals as either at risk or not. The receiver operating characteristic curve was used to assess classification performances. Data mining and resampling improved cardiovascular risk assessment in terms of increased area under the curve. The proposed methodology can be effectively applied to biomedical data in order to optimize clinical decision making, and—at the same time—minimize the amount of unnecessary examinations.https://www.mdpi.com/1660-4601/17/21/7923medical diagnosticdecision treelogistic regressionmachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Davide Barbieri
Nitesh Chawla
Luciana Zaccagni
Tonći Grgurinović
Jelena Šarac
Miran Čoklo
Saša Missoni
spellingShingle Davide Barbieri
Nitesh Chawla
Luciana Zaccagni
Tonći Grgurinović
Jelena Šarac
Miran Čoklo
Saša Missoni
Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance
International Journal of Environmental Research and Public Health
medical diagnostic
decision tree
logistic regression
machine learning
author_facet Davide Barbieri
Nitesh Chawla
Luciana Zaccagni
Tonći Grgurinović
Jelena Šarac
Miran Čoklo
Saša Missoni
author_sort Davide Barbieri
title Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance
title_short Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance
title_full Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance
title_fullStr Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance
title_full_unstemmed Predicting Cardiovascular Risk in Athletes: Resampling Improves Classification Performance
title_sort predicting cardiovascular risk in athletes: resampling improves classification performance
publisher MDPI AG
series International Journal of Environmental Research and Public Health
issn 1661-7827
1660-4601
publishDate 2020-10-01
description <b> </b>Cardiovascular diseases are the main cause of death worldwide. The aim of the present study is to verify the performances of a data mining methodology in the evaluation of cardiovascular risk in athletes, and whether the results may be used to support clinical decision making. Anthropometric (height and weight), demographic (age and sex) and biomedical (blood pressure and pulse rate) data of 26,002 athletes were collected in 2012 during routine sport medical examinations, which included electrocardiography at rest. Subjects were involved in competitive sport practice, for which medical clearance was needed. Outcomes were negative for the largest majority, as expected in an active population. Resampling was applied to balance positive/negative class ratio. A decision tree and logistic regression were used to classify individuals as either at risk or not. The receiver operating characteristic curve was used to assess classification performances. Data mining and resampling improved cardiovascular risk assessment in terms of increased area under the curve. The proposed methodology can be effectively applied to biomedical data in order to optimize clinical decision making, and—at the same time—minimize the amount of unnecessary examinations.
topic medical diagnostic
decision tree
logistic regression
machine learning
url https://www.mdpi.com/1660-4601/17/21/7923
work_keys_str_mv AT davidebarbieri predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
AT niteshchawla predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
AT lucianazaccagni predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
AT toncigrgurinovic predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
AT jelenasarac predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
AT mirancoklo predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
AT sasamissoni predictingcardiovascularriskinathletesresamplingimprovesclassificationperformance
_version_ 1724456681418522624