A Study on mining association rules with Incremental updates

碩士 === 逢甲大學 === 資訊工程所 === 90 === Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then th...

Full description

Bibliographic Details
Main Authors:	Chung-Yung Wu, 吳忠勇
Other Authors:	Don-Lin Yang
Format:	Others
Language:	zh-TW
Published:	2002
Online Access:	http://ndltd.ncl.edu.tw/handle/pw465k

id	ndltd-TW-090FCU05392041
record_format	oai_dc
spelling	ndltd-TW-090FCU053920412018-05-11T04:19:33Z http://ndltd.ncl.edu.tw/handle/pw465k A Study on mining association rules with Incremental updates 漸進式異動資料的關聯法則挖掘之研究 Chung-Yung Wu 吳忠勇碩士逢甲大學資訊工程所 90 Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then the only choice is to rerun the static algorithm of association rules mining once. Unfortunately the database will have a high probability to vary in the practical application. The problem that we call incremental mining of association rules is focusing on how to quickly update the association rules of the updated database and it is interesting for many people. In this thesis, in order to efficiently update the frequent itemsets of the updated database, we presented two efficient algorithms to handle the incremental maintenance of association rules. For convenience we call the two algorithms EIM-A and EIM-G simply. EIM-A is focusing on only there are new data added in the original database. It only needs to scan the original database less then once and using the frequent itemsets that contain in the knowledge database to create filter condition. By the condition filter the candidate itemsets that large in incremental data. The best case is never to scan the original database. The worst case is just using the large itemset generated from incremental data to rescan the original database. EIM-G not only handle add data but also data is deleted from original database. EIM-G utilizes the minimal infrequent itemset that contained in the knowledge database to generate new candidate itemsets. EIM-G also maintain the hashing table that in the knowledge database in order to avoid the number of candidate itemsets to be too huge. The experiment results will prove the peroformance of EIM-G. EIM-G will have a good performance especially when the number of original database is much large then the number of incremental data. Don-Lin Yang 楊東麟 2002 學位論文 ; thesis 101 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 逢甲大學 === 資訊工程所 === 90 === Although many efficient algorithms have been proposed for the discovery of association rules and the greater part of these algorithms can obtain good performance. But there is a serious problem in these algorithms. As long as the original database changed and then the only choice is to rerun the static algorithm of association rules mining once. Unfortunately the database will have a high probability to vary in the practical application. The problem that we call incremental mining of association rules is focusing on how to quickly update the association rules of the updated database and it is interesting for many people. In this thesis, in order to efficiently update the frequent itemsets of the updated database, we presented two efficient algorithms to handle the incremental maintenance of association rules. For convenience we call the two algorithms EIM-A and EIM-G simply. EIM-A is focusing on only there are new data added in the original database. It only needs to scan the original database less then once and using the frequent itemsets that contain in the knowledge database to create filter condition. By the condition filter the candidate itemsets that large in incremental data. The best case is never to scan the original database. The worst case is just using the large itemset generated from incremental data to rescan the original database. EIM-G not only handle add data but also data is deleted from original database. EIM-G utilizes the minimal infrequent itemset that contained in the knowledge database to generate new candidate itemsets. EIM-G also maintain the hashing table that in the knowledge database in order to avoid the number of candidate itemsets to be too huge. The experiment results will prove the peroformance of EIM-G. EIM-G will have a good performance especially when the number of original database is much large then the number of incremental data.
author2	Don-Lin Yang
author_facet	Don-Lin Yang Chung-Yung Wu 吳忠勇
author	Chung-Yung Wu 吳忠勇
spellingShingle	Chung-Yung Wu 吳忠勇 A Study on mining association rules with Incremental updates
author_sort	Chung-Yung Wu
title	A Study on mining association rules with Incremental updates
title_short	A Study on mining association rules with Incremental updates
title_full	A Study on mining association rules with Incremental updates
title_fullStr	A Study on mining association rules with Incremental updates
title_full_unstemmed	A Study on mining association rules with Incremental updates
title_sort	study on mining association rules with incremental updates
publishDate	2002
url	http://ndltd.ncl.edu.tw/handle/pw465k
work_keys_str_mv	AT chungyungwu astudyonminingassociationruleswithincrementalupdates AT wúzhōngyǒng astudyonminingassociationruleswithincrementalupdates AT chungyungwu jiànjìnshìyìdòngzīliàodeguānliánfǎzéwājuézhīyánjiū AT wúzhōngyǒng jiànjìnshìyìdòngzīliàodeguānliánfǎzéwājuézhīyánjiū AT chungyungwu studyonminingassociationruleswithincrementalupdates AT wúzhōngyǒng studyonminingassociationruleswithincrementalupdates
_version_	1718635996407398400

A Study on mining association rules with Incremental updates

Similar Items