A Study on Efficient Algorithms for Partial Periodic Pattern Mining

博士 === 國立成功大學 === 製造資訊與系統研究所 === 102 === Data mining techniques have been widely used in a variety of data-analysis applications in recent years. To find useful rules or patterns in a single long-term time series data, the periodic pattern mining has become a very popular research topic. In real-lif...

Full description

Bibliographic Details
Main Authors: Kung-Jiuan Yang, 楊恭娟
Other Authors: 陳裕民
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/66344711128060776432
id ndltd-TW-102NCKU0621002
record_format oai_dc
spelling ndltd-TW-102NCKU06210022017-01-27T04:12:12Z http://ndltd.ncl.edu.tw/handle/66344711128060776432 A Study on Efficient Algorithms for Partial Periodic Pattern Mining 高效性部份週期樣式探勘演算法之研究 Kung-Jiuan Yang 楊恭娟 博士 國立成功大學 製造資訊與系統研究所 102 Data mining techniques have been widely used in a variety of data-analysis applications in recent years. To find useful rules or patterns in a single long-term time series data, the periodic pattern mining has become a very popular research topic. In real-life examples, "partial” periodic pattern mining is more flexible than “full” periodic patterns. The main reason is that the events of some time positions in a pattern can be uncertain. However, since partial periodic pattern mining can ignore the events of some time positions in a period, it has to generate a large number of candidate patterns in mining. Then, how to develop efficient partial periodic pattern mining algorithms for saving time cost is a critical issue. Besides, since most of studies related to partial periodic pattern mining only consider the supports of items in period segments, many useful patterns with low-frequency but high-significance in event sequence data may not be found. Hence, in this dissertation, we propose not only two efficient projection-based mining algorithms but also the two new issues, respectively named weighted partial periodic pattern mining (WPPP) and partial periodic pattern mining with multiple minimum constraints (PPPMM). As to the traditional partial periodic pattern mining, the two algorithms, PPA (Projection-based Pattern Mining Approach) and PRA (Pruning Redundancy Approach), were proposed to enhance the execution efficiency in finding partial periodic patterns from a single event sequence with single or multiple events in a time point. Different from the PPA algorithm without any strategies, the PRA algorithm adopts two effective strategies, pruning and filtering, to reduce a large number of candidates in mining. The experimental results on several synthetic and real datasets showed the proposed approaches get up to 70% performance improvement when compared to the traditional MSA (Max-Subpattern Hit Set) algorithm. For the issue of WPPP, since the downward-closure property cannot be kept in this problem, an effective upper-bound model, which the maximum weight of all events in a period segment as the upper-bound of any sub-pattern in that segment, is developed to achieve this goal. Based on the model, a two-phase mining approach PWA (Projection-based Weighted Mining Approach) is also presented to complete the WPPP mining tasks. For another issue PPPMM, an efficient two-phase mining approach PAMMS (Projection-based Mining Approach with Multiple Minimum Supports) is proposed to handle this problem. Especially, since the downward-closure property is not kept in the problem of PPPMM, the minimum constraint value of all events in mining is used to avoid information losing. Finally, the experimental results show that the performance of both PWA and PAMMS in terms of pruning effectiveness and execution efficiency on synthetic and real datasets. 陳裕民 洪宗貝 2013 學位論文 ; thesis 110 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 博士 === 國立成功大學 === 製造資訊與系統研究所 === 102 === Data mining techniques have been widely used in a variety of data-analysis applications in recent years. To find useful rules or patterns in a single long-term time series data, the periodic pattern mining has become a very popular research topic. In real-life examples, "partial” periodic pattern mining is more flexible than “full” periodic patterns. The main reason is that the events of some time positions in a pattern can be uncertain. However, since partial periodic pattern mining can ignore the events of some time positions in a period, it has to generate a large number of candidate patterns in mining. Then, how to develop efficient partial periodic pattern mining algorithms for saving time cost is a critical issue. Besides, since most of studies related to partial periodic pattern mining only consider the supports of items in period segments, many useful patterns with low-frequency but high-significance in event sequence data may not be found. Hence, in this dissertation, we propose not only two efficient projection-based mining algorithms but also the two new issues, respectively named weighted partial periodic pattern mining (WPPP) and partial periodic pattern mining with multiple minimum constraints (PPPMM). As to the traditional partial periodic pattern mining, the two algorithms, PPA (Projection-based Pattern Mining Approach) and PRA (Pruning Redundancy Approach), were proposed to enhance the execution efficiency in finding partial periodic patterns from a single event sequence with single or multiple events in a time point. Different from the PPA algorithm without any strategies, the PRA algorithm adopts two effective strategies, pruning and filtering, to reduce a large number of candidates in mining. The experimental results on several synthetic and real datasets showed the proposed approaches get up to 70% performance improvement when compared to the traditional MSA (Max-Subpattern Hit Set) algorithm. For the issue of WPPP, since the downward-closure property cannot be kept in this problem, an effective upper-bound model, which the maximum weight of all events in a period segment as the upper-bound of any sub-pattern in that segment, is developed to achieve this goal. Based on the model, a two-phase mining approach PWA (Projection-based Weighted Mining Approach) is also presented to complete the WPPP mining tasks. For another issue PPPMM, an efficient two-phase mining approach PAMMS (Projection-based Mining Approach with Multiple Minimum Supports) is proposed to handle this problem. Especially, since the downward-closure property is not kept in the problem of PPPMM, the minimum constraint value of all events in mining is used to avoid information losing. Finally, the experimental results show that the performance of both PWA and PAMMS in terms of pruning effectiveness and execution efficiency on synthetic and real datasets.
author2 陳裕民
author_facet 陳裕民
Kung-Jiuan Yang
楊恭娟
author Kung-Jiuan Yang
楊恭娟
spellingShingle Kung-Jiuan Yang
楊恭娟
A Study on Efficient Algorithms for Partial Periodic Pattern Mining
author_sort Kung-Jiuan Yang
title A Study on Efficient Algorithms for Partial Periodic Pattern Mining
title_short A Study on Efficient Algorithms for Partial Periodic Pattern Mining
title_full A Study on Efficient Algorithms for Partial Periodic Pattern Mining
title_fullStr A Study on Efficient Algorithms for Partial Periodic Pattern Mining
title_full_unstemmed A Study on Efficient Algorithms for Partial Periodic Pattern Mining
title_sort study on efficient algorithms for partial periodic pattern mining
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/66344711128060776432
work_keys_str_mv AT kungjiuanyang astudyonefficientalgorithmsforpartialperiodicpatternmining
AT yánggōngjuān astudyonefficientalgorithmsforpartialperiodicpatternmining
AT kungjiuanyang gāoxiàoxìngbùfènzhōuqīyàngshìtànkānyǎnsuànfǎzhīyánjiū
AT yánggōngjuān gāoxiàoxìngbùfènzhōuqīyàngshìtànkānyǎnsuànfǎzhīyánjiū
AT kungjiuanyang studyonefficientalgorithmsforpartialperiodicpatternmining
AT yánggōngjuān studyonefficientalgorithmsforpartialperiodicpatternmining
_version_ 1718410280906522624