Cluster-based information retrieval modeling
Cluster-based information retrieval, an extension of information retrieval strategy, is based on the assumption that a document collection can be organized into a set of topics so that a user can enhance retrieval effectiveness. The cluster-based IR model assumes that queries can be associated with...
Main Author: | |
---|---|
Language: | English |
Published: |
2009
|
Online Access: | http://hdl.handle.net/2429/15435 |
id |
ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-15435 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-LACETR-oai-collectionscanada.gc.ca-BVAU.2429-154352014-03-14T15:48:17Z Cluster-based information retrieval modeling Sze, Richard Cluster-based information retrieval, an extension of information retrieval strategy, is based on the assumption that a document collection can be organized into a set of topics so that a user can enhance retrieval effectiveness. The cluster-based IR model assumes that queries can be associated with clusters that contain high concentrations of relevant documents, and that such association can lead to gains in retrieval effectiveness. Earlier studies, however, have provided negative to mixed results for the performance of the model. Moreover, studies are lacking which investigate the potential of the model in situations where queries are manually associated with the appropriate clusters. The goal of this thesis is to provide evidence for the validity of the cluster-base IR model's effectiveness through conducting extensive empirical studies which explore alternative schemes of the model on a large scale and according to a well-accepted benchmark. Investigation shows that the cluster-based IR model has the potential to enhance retrieval effectiveness, and yet, alternative techniques fail to actually achieve enhanced effectiveness. 2009-11-21T20:26:56Z 2009-11-21T20:26:56Z 2004 2009-11-21T20:26:56Z 2004-05 Electronic Thesis or Dissertation http://hdl.handle.net/2429/15435 eng UBC Retrospective Theses Digitization Project [http://www.library.ubc.ca/archives/retro_theses/] |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
description |
Cluster-based information retrieval, an extension of information retrieval strategy, is based on the assumption that a document collection can be organized into a set of topics so that a user can enhance retrieval effectiveness. The cluster-based IR model assumes that queries can be associated with clusters that contain high concentrations of relevant documents, and that such association can lead to gains in retrieval effectiveness. Earlier studies, however, have provided negative to mixed results for the performance of the model. Moreover, studies are lacking which investigate the potential of the model in situations where queries are manually associated with the appropriate clusters. The goal of this thesis is to provide evidence for the validity of the cluster-base IR model's effectiveness through conducting extensive empirical studies which explore alternative schemes of the model on a large scale and according to a well-accepted benchmark. Investigation shows that the cluster-based IR model has the potential to enhance retrieval effectiveness, and yet, alternative techniques fail to actually achieve enhanced effectiveness. |
author |
Sze, Richard |
spellingShingle |
Sze, Richard Cluster-based information retrieval modeling |
author_facet |
Sze, Richard |
author_sort |
Sze, Richard |
title |
Cluster-based information retrieval modeling |
title_short |
Cluster-based information retrieval modeling |
title_full |
Cluster-based information retrieval modeling |
title_fullStr |
Cluster-based information retrieval modeling |
title_full_unstemmed |
Cluster-based information retrieval modeling |
title_sort |
cluster-based information retrieval modeling |
publishDate |
2009 |
url |
http://hdl.handle.net/2429/15435 |
work_keys_str_mv |
AT szerichard clusterbasedinformationretrievalmodeling |
_version_ |
1716653291714117632 |