Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm

Abstract Osteosarcoma is the most common bone malignancy, with the highest incidence in children and adolescents. Survival rate prediction is important for improving prognosis and planning therapy. However, there is still no prediction model with a high accuracy rate for osteosarcoma. Therefore, we...

Full description

Bibliographic Details
Main Authors: Jiuzhou Jiang, Hao Pan, Mobai Li, Bao Qian, Xianfeng Lin, Shunwu Fan
Format: Article
Language:English
Published: Nature Publishing Group 2021-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-85223-4
id doaj-4fc0f872ca3141acb0ce6f8ff31106a9
record_format Article
spelling doaj-4fc0f872ca3141acb0ce6f8ff31106a92021-03-11T12:15:23ZengNature Publishing GroupScientific Reports2045-23222021-03-011111910.1038/s41598-021-85223-4Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithmJiuzhou Jiang0Hao Pan1Mobai Li2Bao Qian3Xianfeng Lin4Shunwu Fan5Department of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedics, The First Affiliated Hospital of Wenzhou Medical UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityDepartment of Orthopaedic Surgery, Sir Run Run Shaw Hospital, Medical College of Zhejiang UniversityAbstract Osteosarcoma is the most common bone malignancy, with the highest incidence in children and adolescents. Survival rate prediction is important for improving prognosis and planning therapy. However, there is still no prediction model with a high accuracy rate for osteosarcoma. Therefore, we aimed to construct an artificial intelligence (AI) model for predicting the 5-year survival of osteosarcoma patients by using extreme gradient boosting (XGBoost), a large-scale machine-learning algorithm. We identified cases of osteosarcoma in the Surveillance, Epidemiology, and End Results (SEER) Research Database and excluded substandard samples. The study population was 835 and was divided into the training set (n = 668) and validation set (n = 167). Characteristics selected via survival analyses were used to construct the model. Receiver operating characteristic (ROC) curve and decision curve analyses were performed to evaluate the prediction. The accuracy of the prediction model was excellent both in the training set (area under the ROC curve [AUC] = 0.977) and the validation set (AUC = 0.911). Decision curve analyses proved the model could be used to support clinical decisions. XGBoost is an effective algorithm for predicting 5-year survival of osteosarcoma patients. Our prediction model had excellent accuracy and is therefore useful in clinical settings.https://doi.org/10.1038/s41598-021-85223-4
collection DOAJ
language English
format Article
sources DOAJ
author Jiuzhou Jiang
Hao Pan
Mobai Li
Bao Qian
Xianfeng Lin
Shunwu Fan
spellingShingle Jiuzhou Jiang
Hao Pan
Mobai Li
Bao Qian
Xianfeng Lin
Shunwu Fan
Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
Scientific Reports
author_facet Jiuzhou Jiang
Hao Pan
Mobai Li
Bao Qian
Xianfeng Lin
Shunwu Fan
author_sort Jiuzhou Jiang
title Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
title_short Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
title_full Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
title_fullStr Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
title_full_unstemmed Predictive model for the 5-year survival status of osteosarcoma patients based on the SEER database and XGBoost algorithm
title_sort predictive model for the 5-year survival status of osteosarcoma patients based on the seer database and xgboost algorithm
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-03-01
description Abstract Osteosarcoma is the most common bone malignancy, with the highest incidence in children and adolescents. Survival rate prediction is important for improving prognosis and planning therapy. However, there is still no prediction model with a high accuracy rate for osteosarcoma. Therefore, we aimed to construct an artificial intelligence (AI) model for predicting the 5-year survival of osteosarcoma patients by using extreme gradient boosting (XGBoost), a large-scale machine-learning algorithm. We identified cases of osteosarcoma in the Surveillance, Epidemiology, and End Results (SEER) Research Database and excluded substandard samples. The study population was 835 and was divided into the training set (n = 668) and validation set (n = 167). Characteristics selected via survival analyses were used to construct the model. Receiver operating characteristic (ROC) curve and decision curve analyses were performed to evaluate the prediction. The accuracy of the prediction model was excellent both in the training set (area under the ROC curve [AUC] = 0.977) and the validation set (AUC = 0.911). Decision curve analyses proved the model could be used to support clinical decisions. XGBoost is an effective algorithm for predicting 5-year survival of osteosarcoma patients. Our prediction model had excellent accuracy and is therefore useful in clinical settings.
url https://doi.org/10.1038/s41598-021-85223-4
work_keys_str_mv AT jiuzhoujiang predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm
AT haopan predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm
AT mobaili predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm
AT baoqian predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm
AT xianfenglin predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm
AT shunwufan predictivemodelforthe5yearsurvivalstatusofosteosarcomapatientsbasedontheseerdatabaseandxgboostalgorithm
_version_ 1724224574701174784