Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort

Chronic diseases represented by circulatory diseases have gradually become the main types of diseases affecting the health of our population. Establishing a circulatory system disease prediction model to predict the occurrence of diseases and controlling them is of great significance to the health o...

Full description

Bibliographic Details
Main Authors: Haijing Tang, Guo Chen, Yu Kang, Xu Yang
Format: Article
Language:English
Published: MDPI AG 2018-10-01
Series:Algorithms
Subjects:
Online Access:http://www.mdpi.com/1999-4893/11/10/162
id doaj-5031360709f449389cf6253275b31b49
record_format Article
spelling doaj-5031360709f449389cf6253275b31b492020-11-24T23:30:13ZengMDPI AGAlgorithms1999-48932018-10-01111016210.3390/a11100162a11100162Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective CohortHaijing Tang0Guo Chen1Yu Kang2Xu Yang3School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, ChinaSchool of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, ChinaSchool of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, ChinaSchool of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, ChinaChronic diseases represented by circulatory diseases have gradually become the main types of diseases affecting the health of our population. Establishing a circulatory system disease prediction model to predict the occurrence of diseases and controlling them is of great significance to the health of our population. This article is based on the prospective population cohort data of chronic diseases in China, based on the existing medical cohort studies, the Kaplan–Meier method was used for feature selection, and the traditional medical analysis model represented by the Cox proportional hazards model was used and introduced. Support vector machine research methods in machine learning establish circulatory system disease prediction models. This paper also attempts to introduce the proportion of the explanation variation (PEV) and the shrinkage factor to improve the Cox proportional hazards model; and the use of Particle Swarm Optimization (PSO) algorithm to optimize the parameters of SVM model. Finally, the experimental verification of the above prediction models is carried out. This paper uses the model training time, Accuracy rate(ACC), the area under curve (AUC)of the Receiver Operator Characteristic curve (ROC) and other forecasting indicators. The experimental results show that the PSO-SVM-CSDPC disease prediction model and the S-Cox-CSDPC circulation system disease prediction model have the advantages of fast model solving speed, accurate prediction results and strong generalization ability, which are helpful for the intervention and control of chronic diseases.http://www.mdpi.com/1999-4893/11/10/162population cohortcirculatory system diseaseCox proportional hazards modelsupport vector machinedisease prediction
collection DOAJ
language English
format Article
sources DOAJ
author Haijing Tang
Guo Chen
Yu Kang
Xu Yang
spellingShingle Haijing Tang
Guo Chen
Yu Kang
Xu Yang
Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort
Algorithms
population cohort
circulatory system disease
Cox proportional hazards model
support vector machine
disease prediction
author_facet Haijing Tang
Guo Chen
Yu Kang
Xu Yang
author_sort Haijing Tang
title Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort
title_short Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort
title_full Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort
title_fullStr Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort
title_full_unstemmed Application of Data Science Technology on Research of Circulatory System Disease Prediction Based on a Prospective Cohort
title_sort application of data science technology on research of circulatory system disease prediction based on a prospective cohort
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2018-10-01
description Chronic diseases represented by circulatory diseases have gradually become the main types of diseases affecting the health of our population. Establishing a circulatory system disease prediction model to predict the occurrence of diseases and controlling them is of great significance to the health of our population. This article is based on the prospective population cohort data of chronic diseases in China, based on the existing medical cohort studies, the Kaplan–Meier method was used for feature selection, and the traditional medical analysis model represented by the Cox proportional hazards model was used and introduced. Support vector machine research methods in machine learning establish circulatory system disease prediction models. This paper also attempts to introduce the proportion of the explanation variation (PEV) and the shrinkage factor to improve the Cox proportional hazards model; and the use of Particle Swarm Optimization (PSO) algorithm to optimize the parameters of SVM model. Finally, the experimental verification of the above prediction models is carried out. This paper uses the model training time, Accuracy rate(ACC), the area under curve (AUC)of the Receiver Operator Characteristic curve (ROC) and other forecasting indicators. The experimental results show that the PSO-SVM-CSDPC disease prediction model and the S-Cox-CSDPC circulation system disease prediction model have the advantages of fast model solving speed, accurate prediction results and strong generalization ability, which are helpful for the intervention and control of chronic diseases.
topic population cohort
circulatory system disease
Cox proportional hazards model
support vector machine
disease prediction
url http://www.mdpi.com/1999-4893/11/10/162
work_keys_str_mv AT haijingtang applicationofdatasciencetechnologyonresearchofcirculatorysystemdiseasepredictionbasedonaprospectivecohort
AT guochen applicationofdatasciencetechnologyonresearchofcirculatorysystemdiseasepredictionbasedonaprospectivecohort
AT yukang applicationofdatasciencetechnologyonresearchofcirculatorysystemdiseasepredictionbasedonaprospectivecohort
AT xuyang applicationofdatasciencetechnologyonresearchofcirculatorysystemdiseasepredictionbasedonaprospectivecohort
_version_ 1725542240018759680