Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.

This paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the surviva...

Full description

Bibliographic Details
Main Authors: Keyvan Karami, Mahboubeh Akbari, Mohammad-Taher Moradi, Bijan Soleymani, Hossein Fallahi
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0254976
id doaj-fd77825e3e4741b6aa5d9b0b2418aa8a
record_format Article
spelling doaj-fd77825e3e4741b6aa5d9b0b2418aa8a2021-08-03T04:33:01ZengPublic Library of Science (PLoS)PLoS ONE1932-62032021-01-01167e025497610.1371/journal.pone.0254976Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.Keyvan KaramiMahboubeh AkbariMohammad-Taher MoradiBijan SoleymaniHossein FallahiThis paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the survival of AML patients. Here, six data mining algorithms including Decision Tree, Random Forrest, Logistic Regression, Naive Bayes, W-Bayes Net, and Gradient Boosted Tree (GBT) are employed for the detection model and implemented using the common data mining tool RapidMiner and open-source R package. To improve the predictive ability of our model, a set of features were selected by employing multiple feature selection methods. The accuracy of classification was obtained using 10-fold cross-validation for the various combinations of the feature selection methods and machine learning algorithms. The performance of the models was assessed by various measurement indexes including accuracy, kappa, sensitivity, specificity, positive predictive value, negative predictive value, and area under the ROC curve (AUC). Our results showed that GBT with an accuracy of 85.17%, AUC of 0.930, and the feature selection via the Relief algorithm has the best performance in predicting the survival rate of AML patients.https://doi.org/10.1371/journal.pone.0254976
collection DOAJ
language English
format Article
sources DOAJ
author Keyvan Karami
Mahboubeh Akbari
Mohammad-Taher Moradi
Bijan Soleymani
Hossein Fallahi
spellingShingle Keyvan Karami
Mahboubeh Akbari
Mohammad-Taher Moradi
Bijan Soleymani
Hossein Fallahi
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
PLoS ONE
author_facet Keyvan Karami
Mahboubeh Akbari
Mohammad-Taher Moradi
Bijan Soleymani
Hossein Fallahi
author_sort Keyvan Karami
title Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
title_short Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
title_full Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
title_fullStr Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
title_full_unstemmed Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
title_sort survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2021-01-01
description This paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the survival of AML patients. Here, six data mining algorithms including Decision Tree, Random Forrest, Logistic Regression, Naive Bayes, W-Bayes Net, and Gradient Boosted Tree (GBT) are employed for the detection model and implemented using the common data mining tool RapidMiner and open-source R package. To improve the predictive ability of our model, a set of features were selected by employing multiple feature selection methods. The accuracy of classification was obtained using 10-fold cross-validation for the various combinations of the feature selection methods and machine learning algorithms. The performance of the models was assessed by various measurement indexes including accuracy, kappa, sensitivity, specificity, positive predictive value, negative predictive value, and area under the ROC curve (AUC). Our results showed that GBT with an accuracy of 85.17%, AUC of 0.930, and the feature selection via the Relief algorithm has the best performance in predicting the survival rate of AML patients.
url https://doi.org/10.1371/journal.pone.0254976
work_keys_str_mv AT keyvankarami survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques
AT mahboubehakbari survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques
AT mohammadtahermoradi survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques
AT bijansoleymani survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques
AT hosseinfallahi survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques
_version_ 1721224027845951488