A Study on mining association rules with Incremental updates

碩士 === 逢甲大學 === 資訊工程所 === 90 === Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then th...

Full description

Bibliographic Details
Main Authors: Chung-Yung Wu, 吳忠勇
Other Authors: Don-Lin Yang
Format: Others
Language:zh-TW
Published: 2002
Online Access:http://ndltd.ncl.edu.tw/handle/pw465k
id ndltd-TW-090FCU05392041
record_format oai_dc
spelling ndltd-TW-090FCU053920412018-05-11T04:19:33Z http://ndltd.ncl.edu.tw/handle/pw465k A Study on mining association rules with Incremental updates 漸進式異動資料的關聯法則挖掘之研究 Chung-Yung Wu 吳忠勇 碩士 逢甲大學 資訊工程所 90 Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then the only choice is to rerun the static algorithm of association rules mining once. Unfortunately the database will have a high probability to vary in the practical application. The problem that we call incremental mining of association rules is focusing on how to quickly update the association rules of the updated database and it is interesting for many people. In this thesis, in order to efficiently update the frequent itemsets of the updated database, we presented two efficient algorithms to handle the incremental maintenance of association rules. For convenience we call the two algorithms EIM-A and EIM-G simply. EIM-A is focusing on only there are new data added in the original database. It only needs to scan the original database less then once and using the frequent itemsets that contain in the knowledge database to create filter condition. By the condition filter the candidate itemsets that large in incremental data. The best case is never to scan the original database. The worst case is just using the large itemset generated from incremental data to rescan the original database. EIM-G not only handle add data but also data is deleted from original database. EIM-G utilizes the minimal infrequent itemset that contained in the knowledge database to generate new candidate itemsets. EIM-G also maintain the hashing table that in the knowledge database in order to avoid the number of candidate itemsets to be too huge. The experiment results will prove the peroformance of EIM-G. EIM-G will have a good performance especially when the number of original database is much large then the number of incremental data. Don-Lin Yang 楊東麟 2002 學位論文 ; thesis 101 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 逢甲大學 === 資訊工程所 === 90 === Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then the only choice is to rerun the static algorithm of association rules mining once. Unfortunately the database will have a high probability to vary in the practical application. The problem that we call incremental mining of association rules is focusing on how to quickly update the association rules of the updated database and it is interesting for many people. In this thesis, in order to efficiently update the frequent itemsets of the updated database, we presented two efficient algorithms to handle the incremental maintenance of association rules. For convenience we call the two algorithms EIM-A and EIM-G simply. EIM-A is focusing on only there are new data added in the original database. It only needs to scan the original database less then once and using the frequent itemsets that contain in the knowledge database to create filter condition. By the condition filter the candidate itemsets that large in incremental data. The best case is never to scan the original database. The worst case is just using the large itemset generated from incremental data to rescan the original database. EIM-G not only handle add data but also data is deleted from original database. EIM-G utilizes the minimal infrequent itemset that contained in the knowledge database to generate new candidate itemsets. EIM-G also maintain the hashing table that in the knowledge database in order to avoid the number of candidate itemsets to be too huge. The experiment results will prove the peroformance of EIM-G. EIM-G will have a good performance especially when the number of original database is much large then the number of incremental data.
author2 Don-Lin Yang
author_facet Don-Lin Yang
Chung-Yung Wu
吳忠勇
author Chung-Yung Wu
吳忠勇
spellingShingle Chung-Yung Wu
吳忠勇
A Study on mining association rules with Incremental updates
author_sort Chung-Yung Wu
title A Study on mining association rules with Incremental updates
title_short A Study on mining association rules with Incremental updates
title_full A Study on mining association rules with Incremental updates
title_fullStr A Study on mining association rules with Incremental updates
title_full_unstemmed A Study on mining association rules with Incremental updates
title_sort study on mining association rules with incremental updates
publishDate 2002
url http://ndltd.ncl.edu.tw/handle/pw465k
work_keys_str_mv AT chungyungwu astudyonminingassociationruleswithincrementalupdates
AT wúzhōngyǒng astudyonminingassociationruleswithincrementalupdates
AT chungyungwu jiànjìnshìyìdòngzīliàodeguānliánfǎzéwājuézhīyánjiū
AT wúzhōngyǒng jiànjìnshìyìdòngzīliàodeguānliánfǎzéwājuézhīyánjiū
AT chungyungwu studyonminingassociationruleswithincrementalupdates
AT wúzhōngyǒng studyonminingassociationruleswithincrementalupdates
_version_ 1718635996407398400