Completion of Quantitative Missing Values Using Principal Component Analysis Method
碩士 === 元智大學 === 資訊管理研究所 === 88 === Missing value is an important issue for the analysts during the process of data mining process. The problem is what kind of methods will be suitable to complete those missing values. A current approach filling the missing values in decision trees is usin...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2000
|
Online Access: | http://ndltd.ncl.edu.tw/handle/19881724073002520891 |
id |
ndltd-TW-088YZU00396012 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-088YZU003960122016-01-29T04:19:40Z http://ndltd.ncl.edu.tw/handle/19881724073002520891 Completion of Quantitative Missing Values Using Principal Component Analysis Method 以主成份分析法處理定量資料缺失值問題 Choa, syie-yi 趙士儀 碩士 元智大學 資訊管理研究所 88 Missing value is an important issue for the analysts during the process of data mining process. The problem is what kind of methods will be suitable to complete those missing values. A current approach filling the missing values in decision trees is using the most common value. This value can be chosen either from the whole data set or from data sets constructed for the classification task. An alternative method, MVC (Missing Values Completion), uses the association rules, discovered with RAR (Robust Association Rules) to mine databases containing multiple missing values, allows to use it for the missing values problem. However, the MVC method is not suitable for mining the quantitative association rules. Our approach is using the concept of PCA (Principal Component Analysis) and eigenvectors figuring out the principal components from quantitative attributes, and using the PCs (Principal Components) handling the missing values problem. The study demonstrated that the principal component analysis model can be used to administer a more reliable and valid values then the regression, with enough and representative data records. For the application, this approach is not only enhancing the validity of the data cleaning progress in KDD, but also predicting correctly the missing values in new data records that is according to the historic data records. Chien-Lung Chan 詹前隆 2000 學位論文 ; thesis 79 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 元智大學 === 資訊管理研究所 === 88 === Missing value is an important issue for the analysts during the process of data mining process. The problem is what kind of methods will be suitable to complete those missing values.
A current approach filling the missing values in decision trees is using the most common value. This value can be chosen either from the whole data set or from data sets constructed for the classification task. An alternative method, MVC (Missing Values Completion), uses the association rules, discovered with RAR (Robust Association Rules) to mine databases containing multiple missing values, allows to use it for the missing values problem. However, the MVC method is not suitable for mining the quantitative association rules.
Our approach is using the concept of PCA (Principal Component Analysis) and eigenvectors figuring out the principal components from quantitative attributes, and using the PCs (Principal Components) handling the missing values problem.
The study demonstrated that the principal component analysis model can be used to administer a more reliable and valid values then the regression, with enough and representative data records.
For the application, this approach is not only enhancing the validity of the data cleaning progress in KDD, but also predicting correctly the missing values in new data records that is according to the historic data records.
|
author2 |
Chien-Lung Chan |
author_facet |
Chien-Lung Chan Choa, syie-yi 趙士儀 |
author |
Choa, syie-yi 趙士儀 |
spellingShingle |
Choa, syie-yi 趙士儀 Completion of Quantitative Missing Values Using Principal Component Analysis Method |
author_sort |
Choa, syie-yi |
title |
Completion of Quantitative Missing Values Using Principal Component Analysis Method |
title_short |
Completion of Quantitative Missing Values Using Principal Component Analysis Method |
title_full |
Completion of Quantitative Missing Values Using Principal Component Analysis Method |
title_fullStr |
Completion of Quantitative Missing Values Using Principal Component Analysis Method |
title_full_unstemmed |
Completion of Quantitative Missing Values Using Principal Component Analysis Method |
title_sort |
completion of quantitative missing values using principal component analysis method |
publishDate |
2000 |
url |
http://ndltd.ncl.edu.tw/handle/19881724073002520891 |
work_keys_str_mv |
AT choasyieyi completionofquantitativemissingvaluesusingprincipalcomponentanalysismethod AT zhàoshìyí completionofquantitativemissingvaluesusingprincipalcomponentanalysismethod AT choasyieyi yǐzhǔchéngfènfēnxīfǎchùlǐdìngliàngzīliàoquēshīzhíwèntí AT zhàoshìyí yǐzhǔchéngfènfēnxīfǎchùlǐdìngliàngzīliàoquēshīzhíwèntí |
_version_ |
1718169620737687552 |