Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform

碩士 === 輔仁大學 === 資訊管理學系碩士班 === 104 === The study aims to analyze users’ search performance and associated behaviors in a refined interface with the aid of the Terrier information retrieval (IR) platform. We used TREC-6 documents and topics provided by Text Retrieval Conference (TREC) as our evaluatio...

Full description

Bibliographic Details
Main Authors: CHAN, KAI-CHUN, 詹鎧駿
Other Authors: WU, I-CHIN
Format: Others
Language:zh-TW
Published: 2016
Online Access:http://ndltd.ncl.edu.tw/handle/75431649560537569031
id ndltd-TW-104FJU00396018
record_format oai_dc
spelling ndltd-TW-104FJU003960182017-09-03T04:25:17Z http://ndltd.ncl.edu.tw/handle/75431649560537569031 Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform 應用分群改善使用者於Terrier資訊檢索平台之 搜尋效益 CHAN, KAI-CHUN 詹鎧駿 碩士 輔仁大學 資訊管理學系碩士班 104 The study aims to analyze users’ search performance and associated behaviors in a refined interface with the aid of the Terrier information retrieval (IR) platform. We used TREC-6 documents and topics provided by Text Retrieval Conference (TREC) as our evaluation dataset. We adopted the language model (Hiemstra_LM) provided by the Terrier IR platform and tested different parameters of the model for retrieving, and then investigating, the effectiveness of the language model (Hiemstra_LM). Furthermore, we applied the LDA (Latent Dirichlet Allocation) clustering method to assist users to conduct search tasks. This research aims to know the effectiveness of the language model by evaluating the precision of the search results. In addition, it also aims to know if grouping documents by the clustering method can help users search efficiently. We recorded users’ search processes to analyze their behaviors. The evaluation results reveal that when the parameter λ (lambda) of the language model is set to 0.5, it can achieve the best retrieval results. In addition, the interface with the clustering function and term suggestions can improve users’ search performance in terms of precision metric. Moreover, although the difficulty of the tasks seems to influence users’ performance during the search process, those that used the interface with the clustering assistance achieved better performance compared to the users without the clustering function. Our preliminary evaluation results show that the clustering and term suggestion functions can improve the users’ search performance. We will conduct a large-scale experiment in the future to validate the results. WU, I-CHIN 吳怡瑾 2016 學位論文 ; thesis 71 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 輔仁大學 === 資訊管理學系碩士班 === 104 === The study aims to analyze users’ search performance and associated behaviors in a refined interface with the aid of the Terrier information retrieval (IR) platform. We used TREC-6 documents and topics provided by Text Retrieval Conference (TREC) as our evaluation dataset. We adopted the language model (Hiemstra_LM) provided by the Terrier IR platform and tested different parameters of the model for retrieving, and then investigating, the effectiveness of the language model (Hiemstra_LM). Furthermore, we applied the LDA (Latent Dirichlet Allocation) clustering method to assist users to conduct search tasks. This research aims to know the effectiveness of the language model by evaluating the precision of the search results. In addition, it also aims to know if grouping documents by the clustering method can help users search efficiently. We recorded users’ search processes to analyze their behaviors. The evaluation results reveal that when the parameter λ (lambda) of the language model is set to 0.5, it can achieve the best retrieval results. In addition, the interface with the clustering function and term suggestions can improve users’ search performance in terms of precision metric. Moreover, although the difficulty of the tasks seems to influence users’ performance during the search process, those that used the interface with the clustering assistance achieved better performance compared to the users without the clustering function. Our preliminary evaluation results show that the clustering and term suggestion functions can improve the users’ search performance. We will conduct a large-scale experiment in the future to validate the results.
author2 WU, I-CHIN
author_facet WU, I-CHIN
CHAN, KAI-CHUN
詹鎧駿
author CHAN, KAI-CHUN
詹鎧駿
spellingShingle CHAN, KAI-CHUN
詹鎧駿
Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
author_sort CHAN, KAI-CHUN
title Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
title_short Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
title_full Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
title_fullStr Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
title_full_unstemmed Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
title_sort applying a clustering method to improve the users’ search performance in the terrier ir platform
publishDate 2016
url http://ndltd.ncl.edu.tw/handle/75431649560537569031
work_keys_str_mv AT chankaichun applyingaclusteringmethodtoimprovetheuserssearchperformanceintheterrierirplatform
AT zhānkǎijùn applyingaclusteringmethodtoimprovetheuserssearchperformanceintheterrierirplatform
AT chankaichun yīngyòngfēnqúngǎishànshǐyòngzhěyúterrierzīxùnjiǎnsuǒpíngtáizhīsōuxúnxiàoyì
AT zhānkǎijùn yīngyòngfēnqúngǎishànshǐyòngzhěyúterrierzīxùnjiǎnsuǒpíngtáizhīsōuxúnxiàoyì
_version_ 1718525498392313856