Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform
碩士 === 輔仁大學 === 資訊管理學系碩士班 === 104 === The study aims to analyze users’ search performance and associated behaviors in a refined interface with the aid of the Terrier information retrieval (IR) platform. We used TREC-6 documents and topics provided by Text Retrieval Conference (TREC) as our evaluatio...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2016
|
Online Access: | http://ndltd.ncl.edu.tw/handle/75431649560537569031 |
id |
ndltd-TW-104FJU00396018 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-104FJU003960182017-09-03T04:25:17Z http://ndltd.ncl.edu.tw/handle/75431649560537569031 Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform 應用分群改善使用者於Terrier資訊檢索平台之 搜尋效益 CHAN, KAI-CHUN 詹鎧駿 碩士 輔仁大學 資訊管理學系碩士班 104 The study aims to analyze users’ search performance and associated behaviors in a refined interface with the aid of the Terrier information retrieval (IR) platform. We used TREC-6 documents and topics provided by Text Retrieval Conference (TREC) as our evaluation dataset. We adopted the language model (Hiemstra_LM) provided by the Terrier IR platform and tested different parameters of the model for retrieving, and then investigating, the effectiveness of the language model (Hiemstra_LM). Furthermore, we applied the LDA (Latent Dirichlet Allocation) clustering method to assist users to conduct search tasks. This research aims to know the effectiveness of the language model by evaluating the precision of the search results. In addition, it also aims to know if grouping documents by the clustering method can help users search efficiently. We recorded users’ search processes to analyze their behaviors. The evaluation results reveal that when the parameter λ (lambda) of the language model is set to 0.5, it can achieve the best retrieval results. In addition, the interface with the clustering function and term suggestions can improve users’ search performance in terms of precision metric. Moreover, although the difficulty of the tasks seems to influence users’ performance during the search process, those that used the interface with the clustering assistance achieved better performance compared to the users without the clustering function. Our preliminary evaluation results show that the clustering and term suggestion functions can improve the users’ search performance. We will conduct a large-scale experiment in the future to validate the results. WU, I-CHIN 吳怡瑾 2016 學位論文 ; thesis 71 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 輔仁大學 === 資訊管理學系碩士班 === 104 === The study aims to analyze users’ search performance and associated behaviors in a refined interface with the aid of the Terrier information retrieval (IR) platform. We used TREC-6 documents and topics provided by Text Retrieval Conference (TREC) as our evaluation dataset. We adopted the language model (Hiemstra_LM) provided by the Terrier IR platform and tested different parameters of the model for retrieving, and then investigating, the effectiveness of the language model (Hiemstra_LM). Furthermore, we applied the LDA (Latent Dirichlet Allocation) clustering method to assist users to conduct search tasks. This research aims to know the effectiveness of the language model by evaluating the precision of the search results. In addition, it also aims to know if grouping documents by the clustering method can help users search efficiently. We recorded users’ search processes to analyze their behaviors.
The evaluation results reveal that when the parameter λ (lambda) of the language model is set to 0.5, it can achieve the best retrieval results. In addition, the interface with the clustering function and term suggestions can improve users’ search performance in terms of precision metric. Moreover, although the difficulty of the tasks seems to influence users’ performance during the search process, those that used the interface with the clustering assistance achieved better performance compared to the users without the clustering function. Our preliminary evaluation results show that the clustering and term suggestion functions can improve the users’ search performance. We will conduct a large-scale experiment in the future to validate the results.
|
author2 |
WU, I-CHIN |
author_facet |
WU, I-CHIN CHAN, KAI-CHUN 詹鎧駿 |
author |
CHAN, KAI-CHUN 詹鎧駿 |
spellingShingle |
CHAN, KAI-CHUN 詹鎧駿 Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform |
author_sort |
CHAN, KAI-CHUN |
title |
Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform |
title_short |
Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform |
title_full |
Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform |
title_fullStr |
Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform |
title_full_unstemmed |
Applying a Clustering Method to Improve the Users’ Search Performance in the Terrier IR Platform |
title_sort |
applying a clustering method to improve the users’ search performance in the terrier ir platform |
publishDate |
2016 |
url |
http://ndltd.ncl.edu.tw/handle/75431649560537569031 |
work_keys_str_mv |
AT chankaichun applyingaclusteringmethodtoimprovetheuserssearchperformanceintheterrierirplatform AT zhānkǎijùn applyingaclusteringmethodtoimprovetheuserssearchperformanceintheterrierirplatform AT chankaichun yīngyòngfēnqúngǎishànshǐyòngzhěyúterrierzīxùnjiǎnsuǒpíngtáizhīsōuxúnxiàoyì AT zhānkǎijùn yīngyòngfēnqúngǎishànshǐyòngzhěyúterrierzīxùnjiǎnsuǒpíngtáizhīsōuxúnxiàoyì |
_version_ |
1718525498392313856 |