Searching Maximal Frequent Itemsets using Random Two-Way Pruning
碩士 === 南台科技大學 === 資訊管理系 === 92 === We propose two novel approaches for searching maximal frequent itemsets. They applied the concepts of genetic algorithm and random pruning on searching maximal frequent itemsets from transaction databases. The natures of these two algorithms make it possible to loc...
Main Author: | |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2004
|
Online Access: | http://ndltd.ncl.edu.tw/handle/90895980239900776702 |
id |
ndltd-TW-092STUT0396039 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-092STUT03960392016-11-22T04:12:28Z http://ndltd.ncl.edu.tw/handle/90895980239900776702 Searching Maximal Frequent Itemsets using Random Two-Way Pruning 應用隨機雙向的修剪機制尋找最大高頻項目集 楊哲綜 碩士 南台科技大學 資訊管理系 92 We propose two novel approaches for searching maximal frequent itemsets. They applied the concepts of genetic algorithm and random pruning on searching maximal frequent itemsets from transaction databases. The natures of these two algorithms make it possible to locate the frequent itemsets on their searching trees directly and free from enumerating all the frequent level by level. The first one is called by GAMax (Genetic Algorithm Based Searching of Maximal Frequent Itemsets). GAMax based on the concepts of finding optimal solutions to find the maximal frequent itemsets in transaction database. It is an alternative for solving pattern generation in association rules mining. The objective is to identify maximal frequent itemsets in lexicographic tree, needless to enumerate all the frequent itemsets level by level. Through well-defined fitness functions and operators, GAMax manipulates the specific meaningful codes in search space iteratively to determine the itemsets. This research transits the problems of generate the maximal frequent itemsets of association rules mining into optimization problems and implement using genetic algorithm. The other is called by RP_MFI (Searching Maximal Frequent Itemsets with Random Pruning). It was developed through improving the performance of GAMax. To accelerate the convergence of chromosomes and avoid the duplicate results of convergence are the points of improvements. RP_MFI applied the random pruning and two-way pruning to achieve better performance than GAMax. The experimental results of RP_MFI prove its efficiency. It is the new concepts to searching randomly and also expected to open new visions in combining genetic algorithm and data mining. They are both the flourishing areas of research and promising in current. 黃仁鵬 2004 學位論文 ; thesis 58 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 南台科技大學 === 資訊管理系 === 92 === We propose two novel approaches for searching maximal frequent itemsets. They applied the concepts of genetic algorithm and random pruning on searching maximal frequent itemsets from transaction databases. The natures of these two algorithms make it possible to locate the frequent itemsets on their searching trees directly and free from enumerating all the frequent level by level.
The first one is called by GAMax (Genetic Algorithm Based Searching of Maximal Frequent Itemsets). GAMax based on the concepts of finding optimal solutions to find the maximal frequent itemsets in transaction database. It is an alternative for solving pattern generation in association rules mining. The objective is to identify maximal frequent itemsets in lexicographic tree, needless to enumerate all the frequent itemsets level by level. Through well-defined fitness functions and operators, GAMax manipulates the specific meaningful codes in search space iteratively to determine the itemsets. This research transits the problems of generate the maximal frequent itemsets of association rules mining into optimization problems and implement using genetic algorithm.
The other is called by RP_MFI (Searching Maximal Frequent Itemsets with Random Pruning). It was developed through improving the performance of GAMax. To accelerate the convergence of chromosomes and avoid the duplicate results of convergence are the points of improvements. RP_MFI applied the random pruning and two-way pruning to achieve better performance than GAMax. The experimental results of RP_MFI prove its efficiency.
It is the new concepts to searching randomly and also expected to open new visions in combining genetic algorithm and data mining. They are both the flourishing areas of research and promising in current.
|
author2 |
黃仁鵬 |
author_facet |
黃仁鵬 楊哲綜 |
author |
楊哲綜 |
spellingShingle |
楊哲綜 Searching Maximal Frequent Itemsets using Random Two-Way Pruning |
author_sort |
楊哲綜 |
title |
Searching Maximal Frequent Itemsets using Random Two-Way Pruning |
title_short |
Searching Maximal Frequent Itemsets using Random Two-Way Pruning |
title_full |
Searching Maximal Frequent Itemsets using Random Two-Way Pruning |
title_fullStr |
Searching Maximal Frequent Itemsets using Random Two-Way Pruning |
title_full_unstemmed |
Searching Maximal Frequent Itemsets using Random Two-Way Pruning |
title_sort |
searching maximal frequent itemsets using random two-way pruning |
publishDate |
2004 |
url |
http://ndltd.ncl.edu.tw/handle/90895980239900776702 |
work_keys_str_mv |
AT yángzhézōng searchingmaximalfrequentitemsetsusingrandomtwowaypruning AT yángzhézōng yīngyòngsuíjīshuāngxiàngdexiūjiǎnjīzhìxúnzhǎozuìdàgāopínxiàngmùjí |
_version_ |
1718396128804732928 |