An Exact String Matching Problem Using Data Encoding Scheme

碩士 === 國立暨南國際大學 === 資訊工程學系 === 96 === The traditional exact string matching problem is to find all locations of a pattern string with length m in a text with length n. Here we propose a new encoding method to shorten the both lengths of pattern and text by substituting the substring between a speci...

Full description

Bibliographic Details
Main Authors: Hui-Min Chen, 陳慧敏
Other Authors: R.C.T. Lee
Format: Others
Language:en_US
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/99302180071058684639
id ndltd-TW-096NCNU0392021
record_format oai_dc
spelling ndltd-TW-096NCNU03920212016-05-16T04:09:53Z http://ndltd.ncl.edu.tw/handle/99302180071058684639 An Exact String Matching Problem Using Data Encoding Scheme 使用編碼技術解決完整字串比對問題 Hui-Min Chen 陳慧敏 碩士 國立暨南國際大學 資訊工程學系 96 The traditional exact string matching problem is to find all locations of a pattern string with length m in a text with length n. Here we propose a new encoding method to shorten the both lengths of pattern and text by substituting the substring between a special character for its length in O(m+n). Then we use an exact matching algorithm to solve the exact string matching problem on the encoding pattern and text. As can be seen、by using the encoding method、the pattern and text can be shortened about 2/|Σ| times the lengths of the original ones. In practice、it performs better than 2/|Σ|. For instance、for an English sentence pattern whose length is 50 and a text whose length is 200000、in average、the pattern is shortened to 6% of its original length and the text is shortened to 12.4% of its original length. Thus、the exact matching can be done in a much shorter time. R.C.T. Lee 李家同 2008 學位論文 ; thesis 50 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立暨南國際大學 === 資訊工程學系 === 96 === The traditional exact string matching problem is to find all locations of a pattern string with length m in a text with length n. Here we propose a new encoding method to shorten the both lengths of pattern and text by substituting the substring between a special character for its length in O(m+n). Then we use an exact matching algorithm to solve the exact string matching problem on the encoding pattern and text. As can be seen、by using the encoding method、the pattern and text can be shortened about 2/|Σ| times the lengths of the original ones. In practice、it performs better than 2/|Σ|. For instance、for an English sentence pattern whose length is 50 and a text whose length is 200000、in average、the pattern is shortened to 6% of its original length and the text is shortened to 12.4% of its original length. Thus、the exact matching can be done in a much shorter time.
author2 R.C.T. Lee
author_facet R.C.T. Lee
Hui-Min Chen
陳慧敏
author Hui-Min Chen
陳慧敏
spellingShingle Hui-Min Chen
陳慧敏
An Exact String Matching Problem Using Data Encoding Scheme
author_sort Hui-Min Chen
title An Exact String Matching Problem Using Data Encoding Scheme
title_short An Exact String Matching Problem Using Data Encoding Scheme
title_full An Exact String Matching Problem Using Data Encoding Scheme
title_fullStr An Exact String Matching Problem Using Data Encoding Scheme
title_full_unstemmed An Exact String Matching Problem Using Data Encoding Scheme
title_sort exact string matching problem using data encoding scheme
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/99302180071058684639
work_keys_str_mv AT huiminchen anexactstringmatchingproblemusingdataencodingscheme
AT chénhuìmǐn anexactstringmatchingproblemusingdataencodingscheme
AT huiminchen shǐyòngbiānmǎjìshùjiějuéwánzhěngzìchuànbǐduìwèntí
AT chénhuìmǐn shǐyòngbiānmǎjìshùjiějuéwánzhěngzìchuànbǐduìwèntí
AT huiminchen exactstringmatchingproblemusingdataencodingscheme
AT chénhuìmǐn exactstringmatchingproblemusingdataencodingscheme
_version_ 1718268254470799360