Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression

Aim: With the improvement in people's living standards, the incidence of chronic renal failure (CRF) is increasing annually. The increase in the number of patients with CRF has significantly increased pressure on China's medical budget. Predicting hospitalization expenses for CRF can provi...

Full description

Bibliographic Details
Main Authors: Pingping Dai, Weifu Chang, Zirui Xin, Haiwei Cheng, Wei Ouyang, Aijing Luo
Format: Article
Language:English
Published: Frontiers Media S.A. 2021-06-01
Series:Frontiers in Public Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpubh.2021.678276/full
id doaj-1b62513781234b83b8ab757817fc04c4
record_format Article
spelling doaj-1b62513781234b83b8ab757817fc04c42021-06-15T06:37:56ZengFrontiers Media S.A.Frontiers in Public Health2296-25652021-06-01910.3389/fpubh.2021.678276678276Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO RegressionPingping Dai0Pingping Dai1Weifu Chang2Zirui Xin3Zirui Xin4Haiwei Cheng5Wei Ouyang6Wei Ouyang7Aijing Luo8Key Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, ChinaDepartment of Medical Information, School of Life Science, Central South University, Changsha, ChinaKey Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, ChinaKey Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, ChinaDepartment of Medical Information, School of Life Science, Central South University, Changsha, ChinaDepartment of Sociology, Central South University, Changsha, ChinaKey Laboratory of Medical Information Research, Third Xiangya Hospital, Central South University, Changsha, ChinaDepartment of Medical Information, School of Life Science, Central South University, Changsha, ChinaSecond Xiangya Hospital, Central South University, Changsha, ChinaAim: With the improvement in people's living standards, the incidence of chronic renal failure (CRF) is increasing annually. The increase in the number of patients with CRF has significantly increased pressure on China's medical budget. Predicting hospitalization expenses for CRF can provide guidance for effective allocation and control of medical costs. The purpose of this study was to use the random forest (RF) method and least absolute shrinkage and selection operator (LASSO) regression to predict personal hospitalization expenses of hospitalized patients with CRF and to evaluate related influencing factors.Methods: The data set was collected from the first page of data of the medical records of three tertiary first-class hospitals for the whole year of 2016. Factors influencing hospitalization expenses for CRF were analyzed. Random forest and least absolute shrinkage and selection operator regression models were used to establish a prediction model for the hospitalization expenses of patients with CRF, and comparisons and evaluations were carried out.Results: For CRF inpatients, statistically significant differences in hospitalization expenses were found for major procedures, medical payment method, hospitalization frequency, length of stay, number of other diagnoses, and number of procedures. The R2 of LASSO regression model and RF regression model are 0.6992 and 0.7946, respectively. The mean absolute error (MAE) and root mean square error (RMSE) of the LASSO regression model were 0.0268 and 0.043, respectively, and the MAE and RMSE of the RF prediction model were 0.0171 and 0.0355, respectively. In the RF model, and the weight of length of stay was the highest (0.730).Conclusions: The hospitalization expenses of patients with CRF are most affected by length of stay. The RF prediction model is superior to the LASSO regression model and can be used to predict the hospitalization expenses of patients with CRF. Health administration departments may consider formulating accurate individualized hospitalization expense reimbursement mechanisms accordingly.https://www.frontiersin.org/articles/10.3389/fpubh.2021.678276/fullrandom forestLASSO regressionchronic renal failurehospitalization costsinfluencing factorsprediction
collection DOAJ
language English
format Article
sources DOAJ
author Pingping Dai
Pingping Dai
Weifu Chang
Zirui Xin
Zirui Xin
Haiwei Cheng
Wei Ouyang
Wei Ouyang
Aijing Luo
spellingShingle Pingping Dai
Pingping Dai
Weifu Chang
Zirui Xin
Zirui Xin
Haiwei Cheng
Wei Ouyang
Wei Ouyang
Aijing Luo
Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression
Frontiers in Public Health
random forest
LASSO regression
chronic renal failure
hospitalization costs
influencing factors
prediction
author_facet Pingping Dai
Pingping Dai
Weifu Chang
Zirui Xin
Zirui Xin
Haiwei Cheng
Wei Ouyang
Wei Ouyang
Aijing Luo
author_sort Pingping Dai
title Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression
title_short Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression
title_full Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression
title_fullStr Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression
title_full_unstemmed Retrospective Study on the Influencing Factors and Prediction of Hospitalization Expenses for Chronic Renal Failure in China Based on Random Forest and LASSO Regression
title_sort retrospective study on the influencing factors and prediction of hospitalization expenses for chronic renal failure in china based on random forest and lasso regression
publisher Frontiers Media S.A.
series Frontiers in Public Health
issn 2296-2565
publishDate 2021-06-01
description Aim: With the improvement in people's living standards, the incidence of chronic renal failure (CRF) is increasing annually. The increase in the number of patients with CRF has significantly increased pressure on China's medical budget. Predicting hospitalization expenses for CRF can provide guidance for effective allocation and control of medical costs. The purpose of this study was to use the random forest (RF) method and least absolute shrinkage and selection operator (LASSO) regression to predict personal hospitalization expenses of hospitalized patients with CRF and to evaluate related influencing factors.Methods: The data set was collected from the first page of data of the medical records of three tertiary first-class hospitals for the whole year of 2016. Factors influencing hospitalization expenses for CRF were analyzed. Random forest and least absolute shrinkage and selection operator regression models were used to establish a prediction model for the hospitalization expenses of patients with CRF, and comparisons and evaluations were carried out.Results: For CRF inpatients, statistically significant differences in hospitalization expenses were found for major procedures, medical payment method, hospitalization frequency, length of stay, number of other diagnoses, and number of procedures. The R2 of LASSO regression model and RF regression model are 0.6992 and 0.7946, respectively. The mean absolute error (MAE) and root mean square error (RMSE) of the LASSO regression model were 0.0268 and 0.043, respectively, and the MAE and RMSE of the RF prediction model were 0.0171 and 0.0355, respectively. In the RF model, and the weight of length of stay was the highest (0.730).Conclusions: The hospitalization expenses of patients with CRF are most affected by length of stay. The RF prediction model is superior to the LASSO regression model and can be used to predict the hospitalization expenses of patients with CRF. Health administration departments may consider formulating accurate individualized hospitalization expense reimbursement mechanisms accordingly.
topic random forest
LASSO regression
chronic renal failure
hospitalization costs
influencing factors
prediction
url https://www.frontiersin.org/articles/10.3389/fpubh.2021.678276/full
work_keys_str_mv AT pingpingdai retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT pingpingdai retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT weifuchang retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT ziruixin retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT ziruixin retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT haiweicheng retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT weiouyang retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT weiouyang retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
AT aijingluo retrospectivestudyontheinfluencingfactorsandpredictionofhospitalizationexpensesforchronicrenalfailureinchinabasedonrandomforestandlassoregression
_version_ 1721376916169031680