Applying Hybrid Data Preprocessing Methods in Stroke Prediction

碩士 === 國立臺灣科技大學 === 工業管理系 === 101 === Stroke has always been highlighted as a big threat of health in the worldwide. Brain image examination and ultrasound are some alternatives to discover stroke disease. Data mining has been used widely in many areas, including medical industry. The uses of data m...

Full description

Bibliographic Details
Main Authors: Muhammad Rieza, 慕哈曼
Other Authors: Chao Ou-Yang
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/32906847799465722877
id ndltd-TW-101NTUS5041112
record_format oai_dc
spelling ndltd-TW-101NTUS50411122016-03-21T04:28:02Z http://ndltd.ncl.edu.tw/handle/32906847799465722877 Applying Hybrid Data Preprocessing Methods in Stroke Prediction 發展一多階數資料探勘方法建立腦中風風險預測模型 Muhammad Rieza 慕哈曼 碩士 國立臺灣科技大學 工業管理系 101 Stroke has always been highlighted as a big threat of health in the worldwide. Brain image examination and ultrasound are some alternatives to discover stroke disease. Data mining has been used widely in many areas, including medical industry. The uses of data mining methods allow doctors to make prediction of certain diseases. Therefore, in this research, hybrid model integrating imbalance data preprocessing, feature selection, and back propagation network, support vector machine, decision tree for stroke prediction. The dataset used is brain examination data which collected from 2004 to 2011. However, highly imbalance dataset available can impact the performance of prediction as well as feature selected. The study firstly “rebalance” the dataset by comparing sampling methods; Random Under Sampling by Age and RUSboost. In addition, important features of balance training dataset would be selected by information gain and stepwise regression analysis. Towards the end, selected features would be processed using Back Propagation Network, Support Vector Machine and Decision Tree to predict the stroke. These hybrid methods may assist doctor to provide some possibilities information to the patient. Chao Ou-Yang 歐陽超 2013 學位論文 ; thesis 62 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 工業管理系 === 101 === Stroke has always been highlighted as a big threat of health in the worldwide. Brain image examination and ultrasound are some alternatives to discover stroke disease. Data mining has been used widely in many areas, including medical industry. The uses of data mining methods allow doctors to make prediction of certain diseases. Therefore, in this research, hybrid model integrating imbalance data preprocessing, feature selection, and back propagation network, support vector machine, decision tree for stroke prediction. The dataset used is brain examination data which collected from 2004 to 2011. However, highly imbalance dataset available can impact the performance of prediction as well as feature selected. The study firstly “rebalance” the dataset by comparing sampling methods; Random Under Sampling by Age and RUSboost. In addition, important features of balance training dataset would be selected by information gain and stepwise regression analysis. Towards the end, selected features would be processed using Back Propagation Network, Support Vector Machine and Decision Tree to predict the stroke. These hybrid methods may assist doctor to provide some possibilities information to the patient.
author2 Chao Ou-Yang
author_facet Chao Ou-Yang
Muhammad Rieza
慕哈曼
author Muhammad Rieza
慕哈曼
spellingShingle Muhammad Rieza
慕哈曼
Applying Hybrid Data Preprocessing Methods in Stroke Prediction
author_sort Muhammad Rieza
title Applying Hybrid Data Preprocessing Methods in Stroke Prediction
title_short Applying Hybrid Data Preprocessing Methods in Stroke Prediction
title_full Applying Hybrid Data Preprocessing Methods in Stroke Prediction
title_fullStr Applying Hybrid Data Preprocessing Methods in Stroke Prediction
title_full_unstemmed Applying Hybrid Data Preprocessing Methods in Stroke Prediction
title_sort applying hybrid data preprocessing methods in stroke prediction
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/32906847799465722877
work_keys_str_mv AT muhammadrieza applyinghybriddatapreprocessingmethodsinstrokeprediction
AT mùhāmàn applyinghybriddatapreprocessingmethodsinstrokeprediction
AT muhammadrieza fāzhǎnyīduōjiēshùzīliàotànkānfāngfǎjiànlìnǎozhōngfēngfēngxiǎnyùcèmóxíng
AT mùhāmàn fāzhǎnyīduōjiēshùzīliàotànkānfāngfǎjiànlìnǎozhōngfēngfēngxiǎnyùcèmóxíng
_version_ 1718209436145680384