Applying the Support Vector Regression to the Missing Value Problems
碩士 === 華梵大學 === 資訊管理學系碩士班 === 98 === Data Mining is now widespread used for many enterprises. There could be missing data from paperwork to electronic system because of human error or out–of–date information. Usually these data might be deleted or using average value, 0 and mode value to fill the mi...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2010
|
Online Access: | http://ndltd.ncl.edu.tw/handle/02368330022191248233 |
id |
ndltd-TW-098HCHT0396063 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-098HCHT03960632015-10-13T18:20:57Z http://ndltd.ncl.edu.tw/handle/02368330022191248233 Applying the Support Vector Regression to the Missing Value Problems 應用支援向量迴歸於遺漏值問題 Hsi–An Chen 陳璽安 碩士 華梵大學 資訊管理學系碩士班 98 Data Mining is now widespread used for many enterprises. There could be missing data from paperwork to electronic system because of human error or out–of–date information. Usually these data might be deleted or using average value, 0 and mode value to fill the missing values, but this can only applicable for fewer data. It will certainly affect the accuracy of data and ultimately unable to provide reliable information to the user. This thesis use open datasets in the test. It use some data with missing values at random from the open datasets, then use average value, 0, Back–propagation Network (BPN) and Support Vector Regression (SVR) to analyze numerical backfill. Finally this thesis use regression tree to analyze the comparisons. The result shows that anticipation value by using SVR has the closest average error to the original value for missing value. Zne–Jung Lee 李仁鐘 2010 學位論文 ; thesis 51 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 華梵大學 === 資訊管理學系碩士班 === 98 === Data Mining is now widespread used for many enterprises. There could be missing data from paperwork to electronic system because of human error or out–of–date information. Usually these data might be deleted or using average value, 0 and mode value to fill the missing values, but this can only applicable for fewer data. It will certainly affect the accuracy of data and ultimately unable to provide reliable information to the user.
This thesis use open datasets in the test. It use some data with missing values at random from the open datasets, then use average value, 0, Back–propagation Network (BPN) and Support Vector Regression (SVR) to analyze numerical backfill. Finally this thesis use regression tree to analyze the comparisons. The result shows that anticipation value by using SVR has the closest average error to the original value for missing value.
|
author2 |
Zne–Jung Lee |
author_facet |
Zne–Jung Lee Hsi–An Chen 陳璽安 |
author |
Hsi–An Chen 陳璽安 |
spellingShingle |
Hsi–An Chen 陳璽安 Applying the Support Vector Regression to the Missing Value Problems |
author_sort |
Hsi–An Chen |
title |
Applying the Support Vector Regression to the Missing Value Problems |
title_short |
Applying the Support Vector Regression to the Missing Value Problems |
title_full |
Applying the Support Vector Regression to the Missing Value Problems |
title_fullStr |
Applying the Support Vector Regression to the Missing Value Problems |
title_full_unstemmed |
Applying the Support Vector Regression to the Missing Value Problems |
title_sort |
applying the support vector regression to the missing value problems |
publishDate |
2010 |
url |
http://ndltd.ncl.edu.tw/handle/02368330022191248233 |
work_keys_str_mv |
AT hsianchen applyingthesupportvectorregressiontothemissingvalueproblems AT chénxǐān applyingthesupportvectorregressiontothemissingvalueproblems AT hsianchen yīngyòngzhīyuánxiàngliànghuíguīyúyílòuzhíwèntí AT chénxǐān yīngyòngzhīyuánxiàngliànghuíguīyúyílòuzhíwèntí |
_version_ |
1718029833778233344 |