Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study

Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learnin...

Full description

Bibliographic Details
Main Authors: Hamada R. H. Al-Absi, Mahmoud Ahmed Refaee, Atiq Ur Rehman, Mohammad Tariqul Islam, Samir Brahim Belhaouari, Tanvir Alam
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9354689/
id doaj-6008b00120da45be91d1c2ef33c53285
record_format Article
spelling doaj-6008b00120da45be91d1c2ef33c532852021-03-30T15:07:06ZengIEEEIEEE Access2169-35362021-01-019299292994110.1109/ACCESS.2021.30594699354689Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control StudyHamada R. H. Al-Absi0https://orcid.org/0000-0002-5636-7632Mahmoud Ahmed Refaee1Atiq Ur Rehman2https://orcid.org/0000-0003-0248-7919Mohammad Tariqul Islam3Samir Brahim Belhaouari4https://orcid.org/0000-0003-2336-0490Tanvir Alam5https://orcid.org/0000-0001-7033-3693College of Science and Engineering, Hamad Bin Khalifa University, Doha, QatarCollege of Science and Engineering, Hamad Bin Khalifa University, Doha, QatarCollege of Science and Engineering, Hamad Bin Khalifa University, Doha, QatarComputer Science Department, Southern Connecticut State University, New Haven, CT, USACollege of Science and Engineering, Hamad Bin Khalifa University, Doha, QatarCollege of Science and Engineering, Hamad Bin Khalifa University, Doha, QatarCardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.https://ieeexplore.ieee.org/document/9354689/Cardiovascular diseasecoronary heart diseasecerebrovascular diseaserisk factormachine learningQatar Biobank (QBB)
collection DOAJ
language English
format Article
sources DOAJ
author Hamada R. H. Al-Absi
Mahmoud Ahmed Refaee
Atiq Ur Rehman
Mohammad Tariqul Islam
Samir Brahim Belhaouari
Tanvir Alam
spellingShingle Hamada R. H. Al-Absi
Mahmoud Ahmed Refaee
Atiq Ur Rehman
Mohammad Tariqul Islam
Samir Brahim Belhaouari
Tanvir Alam
Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
IEEE Access
Cardiovascular disease
coronary heart disease
cerebrovascular disease
risk factor
machine learning
Qatar Biobank (QBB)
author_facet Hamada R. H. Al-Absi
Mahmoud Ahmed Refaee
Atiq Ur Rehman
Mohammad Tariqul Islam
Samir Brahim Belhaouari
Tanvir Alam
author_sort Hamada R. H. Al-Absi
title Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
title_short Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
title_full Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
title_fullStr Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
title_full_unstemmed Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
title_sort risk factors and comorbidities associated to cardiovascular disease in qatar: a machine learning based case-control study
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.
topic Cardiovascular disease
coronary heart disease
cerebrovascular disease
risk factor
machine learning
Qatar Biobank (QBB)
url https://ieeexplore.ieee.org/document/9354689/
work_keys_str_mv AT hamadarhalabsi riskfactorsandcomorbiditiesassociatedtocardiovasculardiseaseinqataramachinelearningbasedcasecontrolstudy
AT mahmoudahmedrefaee riskfactorsandcomorbiditiesassociatedtocardiovasculardiseaseinqataramachinelearningbasedcasecontrolstudy
AT atiqurrehman riskfactorsandcomorbiditiesassociatedtocardiovasculardiseaseinqataramachinelearningbasedcasecontrolstudy
AT mohammadtariqulislam riskfactorsandcomorbiditiesassociatedtocardiovasculardiseaseinqataramachinelearningbasedcasecontrolstudy
AT samirbrahimbelhaouari riskfactorsandcomorbiditiesassociatedtocardiovasculardiseaseinqataramachinelearningbasedcasecontrolstudy
AT tanviralam riskfactorsandcomorbiditiesassociatedtocardiovasculardiseaseinqataramachinelearningbasedcasecontrolstudy
_version_ 1724179984691494912