An Exact String Matching Problem Using Data Encoding Scheme
碩士 === 國立暨南國際大學 === 資訊工程學系 === 96 === The traditional exact string matching problem is to find all locations of a pattern string with length m in a text with length n. Here we propose a new encoding method to shorten the both lengths of pattern and text by substituting the substring between a speci...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/99302180071058684639 |
id |
ndltd-TW-096NCNU0392021 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096NCNU03920212016-05-16T04:09:53Z http://ndltd.ncl.edu.tw/handle/99302180071058684639 An Exact String Matching Problem Using Data Encoding Scheme 使用編碼技術解決完整字串比對問題 Hui-Min Chen 陳慧敏 碩士 國立暨南國際大學 資訊工程學系 96 The traditional exact string matching problem is to find all locations of a pattern string with length m in a text with length n. Here we propose a new encoding method to shorten the both lengths of pattern and text by substituting the substring between a special character for its length in O(m+n). Then we use an exact matching algorithm to solve the exact string matching problem on the encoding pattern and text. As can be seen、by using the encoding method、the pattern and text can be shortened about 2/|Σ| times the lengths of the original ones. In practice、it performs better than 2/|Σ|. For instance、for an English sentence pattern whose length is 50 and a text whose length is 200000、in average、the pattern is shortened to 6% of its original length and the text is shortened to 12.4% of its original length. Thus、the exact matching can be done in a much shorter time. R.C.T. Lee 李家同 2008 學位論文 ; thesis 50 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立暨南國際大學 === 資訊工程學系 === 96 === The traditional exact string matching problem is to find all locations of a pattern string with length m in a text with length n. Here we propose a new encoding method to shorten the both lengths of pattern and text by substituting the substring between a special character for its length in O(m+n). Then we use an exact matching algorithm to solve the exact string matching problem on the encoding pattern and text. As can be seen、by using the encoding method、the pattern and text can be shortened about 2/|Σ| times the lengths of the original ones. In practice、it performs better than 2/|Σ|. For instance、for an English sentence pattern whose length is 50 and a text whose length is 200000、in average、the pattern is shortened to 6% of its original length and the text is shortened to 12.4% of its original length. Thus、the exact matching can be done in a much shorter time.
|
author2 |
R.C.T. Lee |
author_facet |
R.C.T. Lee Hui-Min Chen 陳慧敏 |
author |
Hui-Min Chen 陳慧敏 |
spellingShingle |
Hui-Min Chen 陳慧敏 An Exact String Matching Problem Using Data Encoding Scheme |
author_sort |
Hui-Min Chen |
title |
An Exact String Matching Problem Using Data Encoding Scheme |
title_short |
An Exact String Matching Problem Using Data Encoding Scheme |
title_full |
An Exact String Matching Problem Using Data Encoding Scheme |
title_fullStr |
An Exact String Matching Problem Using Data Encoding Scheme |
title_full_unstemmed |
An Exact String Matching Problem Using Data Encoding Scheme |
title_sort |
exact string matching problem using data encoding scheme |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/99302180071058684639 |
work_keys_str_mv |
AT huiminchen anexactstringmatchingproblemusingdataencodingscheme AT chénhuìmǐn anexactstringmatchingproblemusingdataencodingscheme AT huiminchen shǐyòngbiānmǎjìshùjiějuéwánzhěngzìchuànbǐduìwèntí AT chénhuìmǐn shǐyòngbiānmǎjìshùjiějuéwánzhěngzìchuànbǐduìwèntí AT huiminchen exactstringmatchingproblemusingdataencodingscheme AT chénhuìmǐn exactstringmatchingproblemusingdataencodingscheme |
_version_ |
1718268254470799360 |