Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
Abstract Osteosarcoma is the most common bone malignancy, with the highest incidence in children and adolescents. Survival rate prediction is important for improving prognosis and planning therapy. However, there is still no prediction model with a high accuracy rate for osteosarcoma. Therefore, we...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Nature Publishing Group
2021-03-01
|
Series: | Scientific Reports |
Online Access: | https://doi.org/10.1038/s41598-021-85223-4 |
id |
doaj-4fc0f872ca3141acb0ce6f8ff31106a9 |
---|---|
record_format |
Article |
spelling |
doaj-4fc0f872ca3141acb0ce6f8ff31106a92021-03-11T12:15:23ZengNature Publishing GroupScientific Reports2045-23222021-03-011111910.1038/s41598-021-85223-4Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithmJiuzhou Jiang0Hao Pan1Mobai Li2Bao Qian3Xianfeng Lin4Shunwu Fan5Department of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedics, The First Affiliated Hospital of Wenzhou Medical UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityAbstract Osteosarcoma is the most common bone malignancy, with the highest incidence in children and adolescents. Survival rate prediction is important for improving prognosis and planning therapy. However, there is still no prediction model with a high accuracy rate for osteosarcoma. Therefore, we aimed to construct an artificial intelligence (AI) model for predicting the 5-year survival of osteosarcoma patients by using extreme gradient boosting (XGBoost), a large-scale machine-learning algorithm. We identified cases of osteosarcoma in the Surveillance, Epidemiology, and End Results (SEER) Research Database and excluded substandard samples. The study population was 835 and was divided into the training set (n = 668) and validation set (n = 167). Characteristics selected via survival analyses were used to construct the model. Receiver operating characteristic (ROC) curve and decision curve analyses were performed to evaluate the prediction. The accuracy of the prediction model was excellent both in the training set (area under the ROC curve [AUC] = 0.977) and the validation set (AUC = 0.911). Decision curve analyses proved the model could be used to support clinical decisions. XGBoost is an effective algorithm for predicting 5-year survival of osteosarcoma patients. Our prediction model had excellent accuracy and is therefore useful in clinical settings.https://doi.org/10.1038/s41598-021-85223-4 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Jiuzhou Jiang Hao Pan Mobai Li Bao Qian Xianfeng Lin Shunwu Fan |
spellingShingle |
Jiuzhou Jiang Hao Pan Mobai Li Bao Qian Xianfeng Lin Shunwu Fan Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm Scientific Reports |
author_facet |
Jiuzhou Jiang Hao Pan Mobai Li Bao Qian Xianfeng Lin Shunwu Fan |
author_sort |
Jiuzhou Jiang |
title |
Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm |
title_short |
Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm |
title_full |
Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm |
title_fullStr |
Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm |
title_full_unstemmed |
Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm |
title_sort |
predictive model for the 5-year survival status of osteosarcoma patients based on the seer database and xgboost algorithm |
publisher |
Nature Publishing Group |
series |
Scientific Reports |
issn |
2045-2322 |
publishDate |
2021-03-01 |
description |
Abstract Osteosarcoma is the most common bone malignancy, with the highest incidence in children and adolescents. Survival rate prediction is important for improving prognosis and planning therapy. However, there is still no prediction model with a high accuracy rate for osteosarcoma. Therefore, we aimed to construct an artificial intelligence (AI) model for predicting the 5-year survival of osteosarcoma patients by using extreme gradient boosting (XGBoost), a large-scale machine-learning algorithm. We identified cases of osteosarcoma in the Surveillance, Epidemiology, and End Results (SEER) Research Database and excluded substandard samples. The study population was 835 and was divided into the training set (n = 668) and validation set (n = 167). Characteristics selected via survival analyses were used to construct the model. Receiver operating characteristic (ROC) curve and decision curve analyses were performed to evaluate the prediction. The accuracy of the prediction model was excellent both in the training set (area under the ROC curve [AUC] = 0.977) and the validation set (AUC = 0.911). Decision curve analyses proved the model could be used to support clinical decisions. XGBoost is an effective algorithm for predicting 5-year survival of osteosarcoma patients. Our prediction model had excellent accuracy and is therefore useful in clinical settings. |
url |
https://doi.org/10.1038/s41598-021-85223-4 |
work_keys_str_mv |
AT jiuzhoujiang predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm AT haopan predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm AT mobaili predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm AT baoqian predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm AT xianfenglin predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm AT shunwufan predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm |
_version_ |
1724224574701174784 |