Algorithms and Models for Collaborative Filtering from Large Information Corpora
Main Author: | |
---|---|
Language: | English |
Published: |
University of Cincinnati / OhioLINK
2008
|
Subjects: | |
Online Access: | http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182 |
id |
ndltd-OhioLink-oai-etd.ohiolink.edu-ucin1220001182 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-OhioLink-oai-etd.ohiolink.edu-ucin12200011822021-08-03T06:12:52Z Algorithms and Models for Collaborative Filtering from Large Information Corpora Strunjas, Svetlana Computer Science collaborative filtering collaborative partitioning clustering information retrieval <p>In this thesis we propose novel collaborative filtering approaches for large data sets. We also demonstrate how these collaborative approaches can be used for creating user recommendations for items, based upon preferences towards items that users demonstrated in the past.</p><p>We propose a framework, called a <i>collaborative partitioning</i> or CP for short, that is focused on finding a partition of a given set of items in order to maximize the number of partition-satisfied users. Two theoretical models for evaluating the quality of partitions are proposed. Both are introduced as bicriteria optimization problems with the percentage of satisfied users and the level of users satisfaction as the two optimization coefficients. As both of these bicriteria optimization problems are NP-hard, we propose Hierarchical Agglomerative Clustering - based approaches to compute approximations of their solutions. The results obtained by running the heuristic approaches on a real dataset show that the proposed approaches for CP have good results and find items partitions that are very close to a human-based genre partition for a given set. The genre partitions are partitions of items according to some human-created classifications. The results also show that the proposed heuristic approaches are a very good starting point in creating a top-k recommendation algorithms.</p><p>The second part of this thesis proposes a collaborative filtering framework for finding <i>seminal</i> and <i>seminally affected work</i> for sets of items. The concept of <i>seminal work</i> for a set of items is used to mark items released in the past that are highly correlated to some future sets of items in the terms of users preferences. Similarly, the <i>seminally affected work</i> is a concept that is used in this thesis to mark items that are highly correlated to some previously released (older) items in the terms of users preferences. In this approach, we translate item-item correlation into a correlation directed acyclic graph (DAG). Direction in the DAG is determined by a chronological ordering of items. We demonstrate and validate the proposed approach by applying it on the web-based system called MovieTrack. This system uses <i>seminal</i> and <i>seminally affected work</i> in movies to give movie recommendations to users. It is built by applying the previously proposed approach on a real data set of movie reviews released by Netflix.</p> 2008 English text University of Cincinnati / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182 http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws. |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
topic |
Computer Science collaborative filtering collaborative partitioning clustering information retrieval |
spellingShingle |
Computer Science collaborative filtering collaborative partitioning clustering information retrieval Strunjas, Svetlana Algorithms and Models for Collaborative Filtering from Large Information Corpora |
author |
Strunjas, Svetlana |
author_facet |
Strunjas, Svetlana |
author_sort |
Strunjas, Svetlana |
title |
Algorithms and Models for Collaborative Filtering from Large Information Corpora |
title_short |
Algorithms and Models for Collaborative Filtering from Large Information Corpora |
title_full |
Algorithms and Models for Collaborative Filtering from Large Information Corpora |
title_fullStr |
Algorithms and Models for Collaborative Filtering from Large Information Corpora |
title_full_unstemmed |
Algorithms and Models for Collaborative Filtering from Large Information Corpora |
title_sort |
algorithms and models for collaborative filtering from large information corpora |
publisher |
University of Cincinnati / OhioLINK |
publishDate |
2008 |
url |
http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182 |
work_keys_str_mv |
AT strunjassvetlana algorithmsandmodelsforcollaborativefilteringfromlargeinformationcorpora |
_version_ |
1719432872077557760 |