Mining Frequent Itemsets from Uncertain Database
碩士 === 銘傳大學 === 資訊工程學系碩士班 === 101 === Mining frequent itemsets from the transaction database is order to find often purchased the combination of products. In other words, contain this itemset transactions reached a user-defined threshold and the transaction record of the transaction purchase those p...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/22190984293586456359 |
id |
ndltd-TW-101MCU05392013 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101MCU053920132017-03-24T05:09:26Z http://ndltd.ncl.edu.tw/handle/22190984293586456359 Mining Frequent Itemsets from Uncertain Database 從不確定資料庫探勘頻繁項目集 Bei-Chuan Yang 楊倍權 碩士 銘傳大學 資訊工程學系碩士班 101 Mining frequent itemsets from the transaction database is order to find often purchased the combination of products. In other words, contain this itemset transactions reached a user-defined threshold and the transaction record of the transaction purchase those product items. We can combine frequent itemset for promotional merchandise to achieve the purpose of increasing sales. Mining frequent itemsets currently most efficient way is to use FP-Tree structure and the transaction database only records whether the product was purchased. However, in some applications, we cannot determine whether the event occurred. For example, at hospital’s diagnostic records database, the doctor will diagnose the patient may have symptoms of what.A data represents the patient may be suffering from what disease symptoms.Each symptoms recorded physician consider possibility, it is usually use a probability value to represent.This type of database called uncertain databases.If a set of items appearing in an uncertain probability of a database reaches a user-defined threshold.This represents the itemset''s items have high probability occur simultaneously.This itemset is called frequent itemset. For example, if thecoughandrunny nosewhilethe probability of occurrenceis high,when apatient hassymptoms of cough, we can determinewhichwill beaccompanied bya runny nosesymptoms, the doctor torefer you to dopreventativedosing. However, it is difficult to find useful information by setting thresholds to data mining.It might have toset differentthresholdsto find theportfolio of projectsmeetuser requirements.However, it is waste time to re-establish FP-Tree when the threshold is been reset.So this paper we propose a method by using similar FP-Tree structure from the database to find uncertain frequent itemsets.If the threshold has changed, our method does not re-create the FP-Tree can directly update the previously identified frequent itemsets.But if without consider thresholds to build the FP-Tree, the database must store all the items.In order to reduceFP-Treeoccupied by thestorage space,we also compression FP-Tree,and directly from the compressed FP-Tree to mine frequent itemsets. Show-Jane Yen 顏秀珍 2013 學位論文 ; thesis 46 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 銘傳大學 === 資訊工程學系碩士班 === 101 === Mining frequent itemsets from the transaction database is order to find often purchased the combination of products. In other words, contain this itemset transactions reached a user-defined threshold and the transaction record of the transaction purchase those product items. We can combine frequent itemset for promotional merchandise to achieve the purpose of increasing sales.
Mining frequent itemsets currently most efficient way is to use FP-Tree structure and the transaction database only records whether the product was purchased. However, in some applications, we cannot determine whether the event occurred. For example, at hospital’s diagnostic records database, the doctor will diagnose the patient may have symptoms of what.A data represents the patient may be suffering from what disease symptoms.Each symptoms recorded physician consider possibility, it is usually use a probability value to represent.This type of database called uncertain databases.If a set of items appearing in an uncertain probability of a database reaches a user-defined threshold.This represents the itemset''s items have high probability occur simultaneously.This itemset is called frequent itemset. For example, if thecoughandrunny nosewhilethe probability of occurrenceis high,when apatient hassymptoms of cough, we can determinewhichwill beaccompanied bya runny nosesymptoms, the doctor torefer you to dopreventativedosing.
However, it is difficult to find useful information by setting thresholds to data mining.It might have toset differentthresholdsto find theportfolio of projectsmeetuser requirements.However, it is waste time to re-establish FP-Tree when the threshold is been reset.So this paper we propose a method by using similar FP-Tree structure from the database to find uncertain frequent itemsets.If the threshold has changed, our method does not re-create the FP-Tree can directly update the previously identified frequent itemsets.But if without consider thresholds to build the FP-Tree, the database must store all the items.In order to reduceFP-Treeoccupied by thestorage space,we also compression FP-Tree,and directly from the compressed FP-Tree to mine frequent itemsets.
|
author2 |
Show-Jane Yen |
author_facet |
Show-Jane Yen Bei-Chuan Yang 楊倍權 |
author |
Bei-Chuan Yang 楊倍權 |
spellingShingle |
Bei-Chuan Yang 楊倍權 Mining Frequent Itemsets from Uncertain Database |
author_sort |
Bei-Chuan Yang |
title |
Mining Frequent Itemsets from Uncertain Database |
title_short |
Mining Frequent Itemsets from Uncertain Database |
title_full |
Mining Frequent Itemsets from Uncertain Database |
title_fullStr |
Mining Frequent Itemsets from Uncertain Database |
title_full_unstemmed |
Mining Frequent Itemsets from Uncertain Database |
title_sort |
mining frequent itemsets from uncertain database |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/22190984293586456359 |
work_keys_str_mv |
AT beichuanyang miningfrequentitemsetsfromuncertaindatabase AT yángbèiquán miningfrequentitemsetsfromuncertaindatabase AT beichuanyang cóngbùquèdìngzīliàokùtànkānpínfánxiàngmùjí AT yángbèiquán cóngbùquèdìngzīliàokùtànkānpínfánxiàngmùjí |
_version_ |
1718434407124041728 |