A Reverse Nearest Neighbor Based Instance Selection Algorithm
碩士 === 國立臺灣科技大學 === 資訊工程系 === 99 === Data reduction is to extract a subset from a dataset. The advantage of data reduction is decreasing the requirement of storage. Using the subset as training data is possible to maintain classification accuracy; sometimes, it can be further improved because of eli...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2011
|
Online Access: | http://ndltd.ncl.edu.tw/handle/67336232094961102183 |
id |
ndltd-TW-099NTUS5392055 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-099NTUS53920552015-10-13T20:13:37Z http://ndltd.ncl.edu.tw/handle/67336232094961102183 A Reverse Nearest Neighbor Based Instance Selection Algorithm 一個基於反向最近鄰居的資料挑選方法 Shu-ming Hsu 許書銘 碩士 國立臺灣科技大學 資訊工程系 99 Data reduction is to extract a subset from a dataset. The advantage of data reduction is decreasing the requirement of storage. Using the subset as training data is possible to maintain classification accuracy; sometimes, it can be further improved because of eliminating noises. The key is how to choose representative samples while ignoring noises at the same time. Many instance selection algorithms are based on Nearest Neighbor decision rule (NN). Some of these algorithms select samples based on two strategies, incremental and decremental. The first type of algorithms selects some instances as samples and iteratively adds instances which do not have the same class label with their nearest sample to the sample set. The second type of algorithms gradually removes instances based on its own strategies. However, we propose an algorithm based on Reverse Nearest Neighbor (RNN), called Reverse Nearest Neighbor Reduction (RNNR). RNNR selects samples which can represent other instances in the same class. In addition, RNNR does not need to iteratively scan a dataset which takes much processing time. Experimental results show that RNNR generally achieves higher accuracy, selects fewer samples and takes less processing time than comparators. Bi-ru Dai 戴碧如 2011 學位論文 ; thesis 30 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣科技大學 === 資訊工程系 === 99 === Data reduction is to extract a subset from a dataset. The advantage of data reduction is decreasing the requirement of storage. Using the subset as training data is possible to maintain classification accuracy; sometimes, it can be further improved because of eliminating noises. The key is how to choose representative samples while ignoring noises at the same time. Many instance selection algorithms are based on Nearest Neighbor decision rule (NN). Some of these algorithms select samples based on two strategies, incremental and decremental. The first type of algorithms selects some instances as samples and iteratively adds instances which do not have the same class label with their nearest sample to the sample set. The second type of algorithms gradually removes instances based on its own strategies. However, we propose an algorithm based on Reverse Nearest Neighbor (RNN), called Reverse Nearest Neighbor Reduction (RNNR). RNNR selects samples which can represent other instances in the same class. In addition, RNNR does not need to iteratively scan a dataset which takes much processing time. Experimental results show that RNNR generally achieves higher accuracy, selects fewer samples and takes less processing time than comparators.
|
author2 |
Bi-ru Dai |
author_facet |
Bi-ru Dai Shu-ming Hsu 許書銘 |
author |
Shu-ming Hsu 許書銘 |
spellingShingle |
Shu-ming Hsu 許書銘 A Reverse Nearest Neighbor Based Instance Selection Algorithm |
author_sort |
Shu-ming Hsu |
title |
A Reverse Nearest Neighbor Based Instance Selection Algorithm |
title_short |
A Reverse Nearest Neighbor Based Instance Selection Algorithm |
title_full |
A Reverse Nearest Neighbor Based Instance Selection Algorithm |
title_fullStr |
A Reverse Nearest Neighbor Based Instance Selection Algorithm |
title_full_unstemmed |
A Reverse Nearest Neighbor Based Instance Selection Algorithm |
title_sort |
reverse nearest neighbor based instance selection algorithm |
publishDate |
2011 |
url |
http://ndltd.ncl.edu.tw/handle/67336232094961102183 |
work_keys_str_mv |
AT shuminghsu areversenearestneighborbasedinstanceselectionalgorithm AT xǔshūmíng areversenearestneighborbasedinstanceselectionalgorithm AT shuminghsu yīgèjīyúfǎnxiàngzuìjìnlínjūdezīliàotiāoxuǎnfāngfǎ AT xǔshūmíng yīgèjīyúfǎnxiàngzuìjìnlínjūdezīliàotiāoxuǎnfāngfǎ AT shuminghsu reversenearestneighborbasedinstanceselectionalgorithm AT xǔshūmíng reversenearestneighborbasedinstanceselectionalgorithm |
_version_ |
1718045449461432320 |