Part-of-Speech Filtering Method in Information Retrieval

碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === In this paper, a word-based indexing method with part-of-speech filtering is proposed for information retrieval. Three different filtering methods: noun, noun-verb and noun-verb-adjective-adverb are used to compare with the traditional word-based method and the b...

Full description

Bibliographic Details
Main Authors: Kai-Wen Yang, 楊凱雯
Other Authors: Guo-Wei Bian
Format: Others
Language:zh-TW
Published: 2009
Online Access:http://ndltd.ncl.edu.tw/handle/79541714918077090492
id ndltd-TW-097HCHT0396055
record_format oai_dc
spelling ndltd-TW-097HCHT03960552015-11-20T04:22:38Z http://ndltd.ncl.edu.tw/handle/79541714918077090492 Part-of-Speech Filtering Method in Information Retrieval 利用詞性過濾之資訊檢索技術 Kai-Wen Yang 楊凱雯 碩士 華梵大學 資訊管理學系碩士班 97 In this paper, a word-based indexing method with part-of-speech filtering is proposed for information retrieval. Three different filtering methods: noun, noun-verb and noun-verb-adjective-adverb are used to compare with the traditional word-based method and the bi-gram method. The experimental results show that the gap of the performance between the bigram method and the word-based method is small. In most cases, the performances of the word-based method with part-of-speech filtering are better than the bigram method. Compare to word-based method, the word-based method with part-of-speech filtering raises obviously the retrieval performances for both of the Headline field and all fields. But the word-based method with part-of-speech filtering shows 5% improvement in average for the retrieval of TEXT field. The word-based method and the word-based method with part-of-speech filtering have smaller indices and less retrieval time than the bigram method. Both methods are more suitable for the very-large scale information retrieval. Guo-Wei Bian 邊國維 2009 學位論文 ; thesis 45 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 華梵大學 === 資訊管理學系碩士班 === 97 === In this paper, a word-based indexing method with part-of-speech filtering is proposed for information retrieval. Three different filtering methods: noun, noun-verb and noun-verb-adjective-adverb are used to compare with the traditional word-based method and the bi-gram method. The experimental results show that the gap of the performance between the bigram method and the word-based method is small. In most cases, the performances of the word-based method with part-of-speech filtering are better than the bigram method. Compare to word-based method, the word-based method with part-of-speech filtering raises obviously the retrieval performances for both of the Headline field and all fields. But the word-based method with part-of-speech filtering shows 5% improvement in average for the retrieval of TEXT field. The word-based method and the word-based method with part-of-speech filtering have smaller indices and less retrieval time than the bigram method. Both methods are more suitable for the very-large scale information retrieval.
author2 Guo-Wei Bian
author_facet Guo-Wei Bian
Kai-Wen Yang
楊凱雯
author Kai-Wen Yang
楊凱雯
spellingShingle Kai-Wen Yang
楊凱雯
Part-of-Speech Filtering Method in Information Retrieval
author_sort Kai-Wen Yang
title Part-of-Speech Filtering Method in Information Retrieval
title_short Part-of-Speech Filtering Method in Information Retrieval
title_full Part-of-Speech Filtering Method in Information Retrieval
title_fullStr Part-of-Speech Filtering Method in Information Retrieval
title_full_unstemmed Part-of-Speech Filtering Method in Information Retrieval
title_sort part-of-speech filtering method in information retrieval
publishDate 2009
url http://ndltd.ncl.edu.tw/handle/79541714918077090492
work_keys_str_mv AT kaiwenyang partofspeechfilteringmethodininformationretrieval
AT yángkǎiwén partofspeechfilteringmethodininformationretrieval
AT kaiwenyang lìyòngcíxìngguòlǜzhīzīxùnjiǎnsuǒjìshù
AT yángkǎiwén lìyòngcíxìngguòlǜzhīzīxùnjiǎnsuǒjìshù
_version_ 1718133180501852160