Mining Repeating Pattern with Gap Constraint

碩士 === 國立交通大學 === 資訊科學與工程研究所 === 97 === Previous studies on mining repeating patterns focus on discovering sub-strings which appear frequently in a long string, converted from the music. An example of such repeating pattern is ”if the stock price of companies A and B both goes up on day one, the sto...

Full description

Bibliographic Details
Main Authors: Shin-Yi Chiu, 邱欣怡
Other Authors: Huang, Jiun-Long
Format: Others
Language:en_US
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/76961904112342784685
id ndltd-TW-097NCTU5394040
record_format oai_dc
spelling ndltd-TW-097NCTU53940402015-10-13T14:53:16Z http://ndltd.ncl.edu.tw/handle/76961904112342784685 Mining Repeating Pattern with Gap Constraint 容許間距之近似重覆樣式探勘 Shin-Yi Chiu 邱欣怡 碩士 國立交通大學 資訊科學與工程研究所 97 Previous studies on mining repeating patterns focus on discovering sub-strings which appear frequently in a long string, converted from the music. An example of such repeating pattern is ”if the stock price of companies A and B both goes up on day one, the stock price of company C will go up on exactly day fifth.” But the problem proposed by Tung gives too much limitation for mining repeating patterns from set sequence, the potential frequent patterns can not be found due to the frequencies distrusted. Hence, in our paper we define a new pattern, which allows the gap between two adjacent sets, and propose an algorithm, G-Apriori, to discover the repeating patterns with gap constraint from a set sequence. G-Apriori algorithm generates candidates and counts the frequency of these candidates by scanning the database. In order to avoid scanning the database so many times, the algorithm, GwI-Apriori is proposed to solve the problem. In GwI-Apriori method, it designs an index list, which contains the start position (SP) and end position (EP) list, for recording the positions of the frequent patterns. Besides, the GwI-Apriori also takes the additional strategy for pruning the searching space among the index lists. By using the index lists, the GwI-Apriori only scans the database once and computes the frequency of frequent patterns through the index lists. The experimental results show that the GwI-Apriori performs much better than G-Apriori. Huang, Jiun-Long Chen, Jing-Ying 黃俊龍 陳俊穎 2009 學位論文 ; thesis 33 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立交通大學 === 資訊科學與工程研究所 === 97 === Previous studies on mining repeating patterns focus on discovering sub-strings which appear frequently in a long string, converted from the music. An example of such repeating pattern is ”if the stock price of companies A and B both goes up on day one, the stock price of company C will go up on exactly day fifth.” But the problem proposed by Tung gives too much limitation for mining repeating patterns from set sequence, the potential frequent patterns can not be found due to the frequencies distrusted. Hence, in our paper we define a new pattern, which allows the gap between two adjacent sets, and propose an algorithm, G-Apriori, to discover the repeating patterns with gap constraint from a set sequence. G-Apriori algorithm generates candidates and counts the frequency of these candidates by scanning the database. In order to avoid scanning the database so many times, the algorithm, GwI-Apriori is proposed to solve the problem. In GwI-Apriori method, it designs an index list, which contains the start position (SP) and end position (EP) list, for recording the positions of the frequent patterns. Besides, the GwI-Apriori also takes the additional strategy for pruning the searching space among the index lists. By using the index lists, the GwI-Apriori only scans the database once and computes the frequency of frequent patterns through the index lists. The experimental results show that the GwI-Apriori performs much better than G-Apriori.
author2 Huang, Jiun-Long
author_facet Huang, Jiun-Long
Shin-Yi Chiu
邱欣怡
author Shin-Yi Chiu
邱欣怡
spellingShingle Shin-Yi Chiu
邱欣怡
Mining Repeating Pattern with Gap Constraint
author_sort Shin-Yi Chiu
title Mining Repeating Pattern with Gap Constraint
title_short Mining Repeating Pattern with Gap Constraint
title_full Mining Repeating Pattern with Gap Constraint
title_fullStr Mining Repeating Pattern with Gap Constraint
title_full_unstemmed Mining Repeating Pattern with Gap Constraint
title_sort mining repeating pattern with gap constraint
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/76961904112342784685
work_keys_str_mv AT shinyichiu miningrepeatingpatternwithgapconstraint
AT qiūxīnyí miningrepeatingpatternwithgapconstraint
AT shinyichiu róngxǔjiānjùzhījìnshìzhòngfùyàngshìtànkān
AT qiūxīnyí róngxǔjiānjùzhījìnshìzhòngfùyàngshìtànkān
_version_ 1717760675411918848