Algorithms and Models for Collaborative Filtering from Large Information Corpora

Bibliographic Details
Main Author: Strunjas, Svetlana
Language:English
Published: University of Cincinnati / OhioLINK 2008
Subjects:
Online Access:http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182
id ndltd-OhioLink-oai-etd.ohiolink.edu-ucin1220001182
record_format oai_dc
spelling ndltd-OhioLink-oai-etd.ohiolink.edu-ucin12200011822021-08-03T06:12:52Z Algorithms and Models for Collaborative Filtering from Large Information Corpora Strunjas, Svetlana Computer Science collaborative filtering collaborative partitioning clustering information retrieval <p>In this thesis we propose novel collaborative filtering approaches for large data sets. We also demonstrate how these collaborative approaches can be used for creating user recommendations for items, based upon preferences towards items that users demonstrated in the past.</p><p>We propose a framework, called a <i>collaborative partitioning</i> or CP for short, that is focused on finding a partition of a given set of items in order to maximize the number of partition-satisfied users. Two theoretical models for evaluating the quality of partitions are proposed. Both are introduced as bicriteria optimization problems with the percentage of satisfied users and the level of users satisfaction as the two optimization coefficients. As both of these bicriteria optimization problems are NP-hard, we propose Hierarchical Agglomerative Clustering - based approaches to compute approximations of their solutions. The results obtained by running the heuristic approaches on a real dataset show that the proposed approaches for CP have good results and find items partitions that are very close to a human-based genre partition for a given set. The genre partitions are partitions of items according to some human-created classifications. The results also show that the proposed heuristic approaches are a very good starting point in creating a top-k recommendation algorithms.</p><p>The second part of this thesis proposes a collaborative filtering framework for finding <i>seminal</i> and <i>seminally affected work</i> for sets of items. The concept of <i>seminal work</i> for a set of items is used to mark items released in the past that are highly correlated to some future sets of items in the terms of users preferences. Similarly, the <i>seminally affected work</i> is a concept that is used in this thesis to mark items that are highly correlated to some previously released (older) items in the terms of users preferences. In this approach, we translate item-item correlation into a correlation directed acyclic graph (DAG). Direction in the DAG is determined by a chronological ordering of items. We demonstrate and validate the proposed approach by applying it on the web-based system called MovieTrack. This system uses <i>seminal</i> and <i>seminally affected work</i> in movies to give movie recommendations to users. It is built by applying the previously proposed approach on a real data set of movie reviews released by Netflix.</p> 2008 English text University of Cincinnati / OhioLINK http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182 http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182 unrestricted This thesis or dissertation is protected by copyright: all rights reserved. It may not be copied or redistributed beyond the terms of applicable copyright laws.
collection NDLTD
language English
sources NDLTD
topic Computer Science
collaborative filtering
collaborative partitioning
clustering
information retrieval
spellingShingle Computer Science
collaborative filtering
collaborative partitioning
clustering
information retrieval
Strunjas, Svetlana
Algorithms and Models for Collaborative Filtering from Large Information Corpora
author Strunjas, Svetlana
author_facet Strunjas, Svetlana
author_sort Strunjas, Svetlana
title Algorithms and Models for Collaborative Filtering from Large Information Corpora
title_short Algorithms and Models for Collaborative Filtering from Large Information Corpora
title_full Algorithms and Models for Collaborative Filtering from Large Information Corpora
title_fullStr Algorithms and Models for Collaborative Filtering from Large Information Corpora
title_full_unstemmed Algorithms and Models for Collaborative Filtering from Large Information Corpora
title_sort algorithms and models for collaborative filtering from large information corpora
publisher University of Cincinnati / OhioLINK
publishDate 2008
url http://rave.ohiolink.edu/etdc/view?acc_num=ucin1220001182
work_keys_str_mv AT strunjassvetlana algorithmsandmodelsforcollaborativefilteringfromlargeinformationcorpora
_version_ 1719432872077557760