Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.
This paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the surviva...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2021-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0254976 |
id |
doaj-fd77825e3e4741b6aa5d9b0b2418aa8a |
---|---|
record_format |
Article |
spelling |
doaj-fd77825e3e4741b6aa5d9b0b2418aa8a2021-08-03T04:33:01ZengPublic Library of Science (PLoS)PLoS ONE1932-62032021-01-01167e025497610.1371/journal.pone.0254976Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques.Keyvan KaramiMahboubeh AkbariMohammad-Taher MoradiBijan SoleymaniHossein FallahiThis paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the survival of AML patients. Here, six data mining algorithms including Decision Tree, Random Forrest, Logistic Regression, Naive Bayes, W-Bayes Net, and Gradient Boosted Tree (GBT) are employed for the detection model and implemented using the common data mining tool RapidMiner and open-source R package. To improve the predictive ability of our model, a set of features were selected by employing multiple feature selection methods. The accuracy of classification was obtained using 10-fold cross-validation for the various combinations of the feature selection methods and machine learning algorithms. The performance of the models was assessed by various measurement indexes including accuracy, kappa, sensitivity, specificity, positive predictive value, negative predictive value, and area under the ROC curve (AUC). Our results showed that GBT with an accuracy of 85.17%, AUC of 0.930, and the feature selection via the Relief algorithm has the best performance in predicting the survival rate of AML patients.https://doi.org/10.1371/journal.pone.0254976 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Keyvan Karami Mahboubeh Akbari Mohammad-Taher Moradi Bijan Soleymani Hossein Fallahi |
spellingShingle |
Keyvan Karami Mahboubeh Akbari Mohammad-Taher Moradi Bijan Soleymani Hossein Fallahi Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. PLoS ONE |
author_facet |
Keyvan Karami Mahboubeh Akbari Mohammad-Taher Moradi Bijan Soleymani Hossein Fallahi |
author_sort |
Keyvan Karami |
title |
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. |
title_short |
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. |
title_full |
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. |
title_fullStr |
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. |
title_full_unstemmed |
Survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. |
title_sort |
survival prognostic factors in patients with acute myeloid leukemia using machine learning techniques. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2021-01-01 |
description |
This paper identifies prognosis factors for survival in patients with acute myeloid leukemia (AML) using machine learning techniques. We have integrated machine learning with feature selection methods and have compared their performances to identify the most suitable factors in assessing the survival of AML patients. Here, six data mining algorithms including Decision Tree, Random Forrest, Logistic Regression, Naive Bayes, W-Bayes Net, and Gradient Boosted Tree (GBT) are employed for the detection model and implemented using the common data mining tool RapidMiner and open-source R package. To improve the predictive ability of our model, a set of features were selected by employing multiple feature selection methods. The accuracy of classification was obtained using 10-fold cross-validation for the various combinations of the feature selection methods and machine learning algorithms. The performance of the models was assessed by various measurement indexes including accuracy, kappa, sensitivity, specificity, positive predictive value, negative predictive value, and area under the ROC curve (AUC). Our results showed that GBT with an accuracy of 85.17%, AUC of 0.930, and the feature selection via the Relief algorithm has the best performance in predicting the survival rate of AML patients. |
url |
https://doi.org/10.1371/journal.pone.0254976 |
work_keys_str_mv |
AT keyvankarami survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques AT mahboubehakbari survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques AT mohammadtahermoradi survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques AT bijansoleymani survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques AT hosseinfallahi survivalprognosticfactorsinpatientswithacutemyeloidleukemiausingmachinelearningtechniques |
_version_ |
1721224027845951488 |