Part-of-Speech Filtering Method in Information Retrieval
碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === In this paper, a word-based indexing method with part-of-speech filtering is proposed for information retrieval. Three different filtering methods: noun, noun-verb and noun-verb-adjective-adverb are used to compare with the traditional word-based method and the b...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2009
|
Online Access: | http://ndltd.ncl.edu.tw/handle/79541714918077090492 |
id |
ndltd-TW-097HCHT0396055 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-097HCHT03960552015-11-20T04:22:38Z http://ndltd.ncl.edu.tw/handle/79541714918077090492 Part-of-Speech Filtering Method in Information Retrieval 利用詞性過濾之資訊檢索技術 Kai-Wen Yang 楊凱雯 碩士 華梵大學 資訊管理學系碩士班 97 In this paper, a word-based indexing method with part-of-speech filtering is proposed for information retrieval. Three different filtering methods: noun, noun-verb and noun-verb-adjective-adverb are used to compare with the traditional word-based method and the bi-gram method. The experimental results show that the gap of the performance between the bigram method and the word-based method is small. In most cases, the performances of the word-based method with part-of-speech filtering are better than the bigram method. Compare to word-based method, the word-based method with part-of-speech filtering raises obviously the retrieval performances for both of the Headline field and all fields. But the word-based method with part-of-speech filtering shows 5% improvement in average for the retrieval of TEXT field. The word-based method and the word-based method with part-of-speech filtering have smaller indices and less retrieval time than the bigram method. Both methods are more suitable for the very-large scale information retrieval. Guo-Wei Bian 邊國維 2009 學位論文 ; thesis 45 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === In this paper, a word-based indexing method with part-of-speech filtering is proposed for information retrieval. Three different filtering methods: noun, noun-verb and noun-verb-adjective-adverb are used to compare with the traditional word-based method and the bi-gram method.
The experimental results show that the gap of the performance between the bigram method and the word-based method is small. In most cases, the performances of the word-based method with part-of-speech filtering are better than the bigram method.
Compare to word-based method, the word-based method with part-of-speech filtering raises obviously the retrieval performances for both of the Headline field and all fields. But the word-based method with part-of-speech filtering shows 5% improvement in average for the retrieval of TEXT field.
The word-based method and the word-based method with part-of-speech filtering have smaller indices and less retrieval time than the bigram method. Both methods are more suitable for the very-large scale information retrieval.
|
author2 |
Guo-Wei Bian |
author_facet |
Guo-Wei Bian Kai-Wen Yang 楊凱雯 |
author |
Kai-Wen Yang 楊凱雯 |
spellingShingle |
Kai-Wen Yang 楊凱雯 Part-of-Speech Filtering Method in Information Retrieval |
author_sort |
Kai-Wen Yang |
title |
Part-of-Speech Filtering Method in Information Retrieval |
title_short |
Part-of-Speech Filtering Method in Information Retrieval |
title_full |
Part-of-Speech Filtering Method in Information Retrieval |
title_fullStr |
Part-of-Speech Filtering Method in Information Retrieval |
title_full_unstemmed |
Part-of-Speech Filtering Method in Information Retrieval |
title_sort |
part-of-speech filtering method in information retrieval |
publishDate |
2009 |
url |
http://ndltd.ncl.edu.tw/handle/79541714918077090492 |
work_keys_str_mv |
AT kaiwenyang partofspeechfilteringmethodininformationretrieval AT yángkǎiwén partofspeechfilteringmethodininformationretrieval AT kaiwenyang lìyòngcíxìngguòlǜzhīzīxùnjiǎnsuǒjìshù AT yángkǎiwén lìyòngcíxìngguòlǜzhīzīxùnjiǎnsuǒjìshù |
_version_ |
1718133180501852160 |