A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting

碩士 === 元培醫事科技大學 === 資訊管理系數位創新管理碩士班 === 106 === It is very important for hospital managers to allocate emergency department (ED) resources efficiently. Forecasting is a vital activity that instructs decision makers, in related research fields, such as industrial scientific planning, economic, and hea...

Full description

Bibliographic Details
Main Authors: TAN, KUO-YEN, 譚國彥
Other Authors: WEI, LIANG-YING
Format: Others
Language:zh-TW
Published: 2018
Online Access:http://ndltd.ncl.edu.tw/handle/2f8b8u
id ndltd-TW-106YUST0620004
record_format oai_dc
spelling ndltd-TW-106YUST06200042019-06-27T05:28:26Z http://ndltd.ncl.edu.tw/handle/2f8b8u A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting 基於EMD特徵選擇和隨機森林法的混合時間序列模型應用於醫療數據預測 TAN, KUO-YEN 譚國彥 碩士 元培醫事科技大學 資訊管理系數位創新管理碩士班 106 It is very important for hospital managers to allocate emergency department (ED) resources efficiently. Forecasting is a vital activity that instructs decision makers, in related research fields, such as industrial scientific planning, economic, and healthcare. Scientists have applied time series methods to daily patient number forecasting at ED. Traditional time series models usually use single variable for forecasting, but noises caused by weather conditions change and environment factors would be included in raw data. Low forecasting performance would be generated because of using complicated raw data in time series models. Further, traditional time series models can not be utilized in all datasets because statistics models need meet statistical assumptions. Multi-attribute data will usually produce high-dimensional data and increase the computational complexity in datamining procedure. For overcoming these drawbacks above, this study proposes a hybrid time series random-forest model based on AR (autogressive) empirical mode decomposition (EMD). Proposed model utilizes EMD to decompose complicated raw data into correlations frequency components, and uses feature section method to reduce high-dimensional input data generated by EMD. Then, this study combines random forest method that can surmount the limitations of statistical methods (data need to obey some mathematical distribution) to forecast daily patient volumes. In order to verification, daily patient volumes in emergency are collected as experimental datasets to evaluate proposed model. Experimental results illustrate that proposed surpasses the listing models. WEI, LIANG-YING 魏良穎 2018 學位論文 ; thesis 19 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 元培醫事科技大學 === 資訊管理系數位創新管理碩士班 === 106 === It is very important for hospital managers to allocate emergency department (ED) resources efficiently. Forecasting is a vital activity that instructs decision makers, in related research fields, such as industrial scientific planning, economic, and healthcare. Scientists have applied time series methods to daily patient number forecasting at ED. Traditional time series models usually use single variable for forecasting, but noises caused by weather conditions change and environment factors would be included in raw data. Low forecasting performance would be generated because of using complicated raw data in time series models. Further, traditional time series models can not be utilized in all datasets because statistics models need meet statistical assumptions. Multi-attribute data will usually produce high-dimensional data and increase the computational complexity in datamining procedure. For overcoming these drawbacks above, this study proposes a hybrid time series random-forest model based on AR (autogressive) empirical mode decomposition (EMD). Proposed model utilizes EMD to decompose complicated raw data into correlations frequency components, and uses feature section method to reduce high-dimensional input data generated by EMD. Then, this study combines random forest method that can surmount the limitations of statistical methods (data need to obey some mathematical distribution) to forecast daily patient volumes. In order to verification, daily patient volumes in emergency are collected as experimental datasets to evaluate proposed model. Experimental results illustrate that proposed surpasses the listing models.
author2 WEI, LIANG-YING
author_facet WEI, LIANG-YING
TAN, KUO-YEN
譚國彥
author TAN, KUO-YEN
譚國彥
spellingShingle TAN, KUO-YEN
譚國彥
A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting
author_sort TAN, KUO-YEN
title A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting
title_short A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting
title_full A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting
title_fullStr A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting
title_full_unstemmed A hybrid time-series model based on EMD-feature selection and Random Forest method for medical data forecasting
title_sort hybrid time-series model based on emd-feature selection and random forest method for medical data forecasting
publishDate 2018
url http://ndltd.ncl.edu.tw/handle/2f8b8u
work_keys_str_mv AT tankuoyen ahybridtimeseriesmodelbasedonemdfeatureselectionandrandomforestmethodformedicaldataforecasting
AT tánguóyàn ahybridtimeseriesmodelbasedonemdfeatureselectionandrandomforestmethodformedicaldataforecasting
AT tankuoyen jīyúemdtèzhēngxuǎnzéhésuíjīsēnlínfǎdehùnhéshíjiānxùlièmóxíngyīngyòngyúyīliáoshùjùyùcè
AT tánguóyàn jīyúemdtèzhēngxuǎnzéhésuíjīsēnlínfǎdehùnhéshíjiānxùlièmóxíngyīngyòngyúyīliáoshùjùyùcè
AT tankuoyen hybridtimeseriesmodelbasedonemdfeatureselectionandrandomforestmethodformedicaldataforecasting
AT tánguóyàn hybridtimeseriesmodelbasedonemdfeatureselectionandrandomforestmethodformedicaldataforecasting
_version_ 1719212515881123840