A Study on mining association rules with Incremental updates
碩士 === 逢甲大學 === 資訊工程所 === 90 === Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then th...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2002
|
Online Access: | http://ndltd.ncl.edu.tw/handle/pw465k |
id |
ndltd-TW-090FCU05392041 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-090FCU053920412018-05-11T04:19:33Z http://ndltd.ncl.edu.tw/handle/pw465k A Study on mining association rules with Incremental updates 漸進式異動資料的關聯法則挖掘之研究 Chung-Yung Wu 吳忠勇 碩士 逢甲大學 資訊工程所 90 Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then the only choice is to rerun the static algorithm of association rules mining once. Unfortunately the database will have a high probability to vary in the practical application. The problem that we call incremental mining of association rules is focusing on how to quickly update the association rules of the updated database and it is interesting for many people. In this thesis, in order to efficiently update the frequent itemsets of the updated database, we presented two efficient algorithms to handle the incremental maintenance of association rules. For convenience we call the two algorithms EIM-A and EIM-G simply. EIM-A is focusing on only there are new data added in the original database. It only needs to scan the original database less then once and using the frequent itemsets that contain in the knowledge database to create filter condition. By the condition filter the candidate itemsets that large in incremental data. The best case is never to scan the original database. The worst case is just using the large itemset generated from incremental data to rescan the original database. EIM-G not only handle add data but also data is deleted from original database. EIM-G utilizes the minimal infrequent itemset that contained in the knowledge database to generate new candidate itemsets. EIM-G also maintain the hashing table that in the knowledge database in order to avoid the number of candidate itemsets to be too huge. The experiment results will prove the peroformance of EIM-G. EIM-G will have a good performance especially when the number of original database is much large then the number of incremental data. Don-Lin Yang 楊東麟 2002 學位論文 ; thesis 101 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 逢甲大學 === 資訊工程所 === 90 === Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then the only choice is to rerun the static algorithm of association rules mining once. Unfortunately the database will have a high probability to vary in the practical application. The problem that we call incremental mining of association rules is focusing on how to quickly update the association rules of the updated database and it is interesting for many people. In this thesis, in order to efficiently update the frequent itemsets of the updated database, we presented two efficient algorithms to handle the incremental maintenance of association rules. For convenience we call the two algorithms EIM-A and EIM-G simply.
EIM-A is focusing on only there are new data added in the original database. It only needs to scan the original database less then once and using the frequent itemsets that contain in the knowledge database to create filter condition. By the condition filter the candidate itemsets that large in incremental data. The best case is never to scan the original database. The worst case is just using the large itemset generated from incremental data to rescan the original database. EIM-G not only handle add data but also data is deleted from original database. EIM-G utilizes the minimal infrequent itemset that contained in the knowledge database to generate new candidate itemsets. EIM-G also maintain the hashing table that in the knowledge database in order to avoid the number of candidate itemsets to be too huge. The experiment results will prove the peroformance of EIM-G. EIM-G will have a good performance especially when the number of original database is much large then the number of incremental data.
|
author2 |
Don-Lin Yang |
author_facet |
Don-Lin Yang Chung-Yung Wu 吳忠勇 |
author |
Chung-Yung Wu 吳忠勇 |
spellingShingle |
Chung-Yung Wu 吳忠勇 A Study on mining association rules with Incremental updates |
author_sort |
Chung-Yung Wu |
title |
A Study on mining association rules with Incremental updates |
title_short |
A Study on mining association rules with Incremental updates |
title_full |
A Study on mining association rules with Incremental updates |
title_fullStr |
A Study on mining association rules with Incremental updates |
title_full_unstemmed |
A Study on mining association rules with Incremental updates |
title_sort |
study on mining association rules with incremental updates |
publishDate |
2002 |
url |
http://ndltd.ncl.edu.tw/handle/pw465k |
work_keys_str_mv |
AT chungyungwu astudyonminingassociationruleswithincrementalupdates AT wúzhōngyǒng astudyonminingassociationruleswithincrementalupdates AT chungyungwu jiànjìnshìyìdòngzīliàodeguānliánfǎzéwājuézhīyánjiū AT wúzhōngyǒng jiànjìnshìyìdòngzīliàodeguānliánfǎzéwājuézhīyánjiū AT chungyungwu studyonminingassociationruleswithincrementalupdates AT wúzhōngyǒng studyonminingassociationruleswithincrementalupdates |
_version_ |
1718635996407398400 |