Quantifying and Improving Sales Diversity in Recommender Systems

Collaborative filtering approaches have produced some of the most accurate and personalized recommender systems to date by mining for similarities in large-scale datasets. However, despite their stellar performance in accuracy based metrics, researchers have demonstrated a propensity by such algorit...

Full description

Bibliographic Details
Main Author:	Antikacioglu, Arda
Format:	Others
Published:	Research Showcase @ CMU 2017
Online Access:	http://repository.cmu.edu/dissertations/959 http://repository.cmu.edu/cgi/viewcontent.cgi?article=1998&context=dissertations

id	ndltd-cmu.edu-oai-repository.cmu.edu-dissertations-1998
record_format	oai_dc
spelling	ndltd-cmu.edu-oai-repository.cmu.edu-dissertations-19982017-07-28T03:24:06Z Quantifying and Improving Sales Diversity in Recommender Systems Antikacioglu, Arda Collaborative filtering approaches have produced some of the most accurate and personalized recommender systems to date by mining for similarities in large-scale datasets. However, despite their stellar performance in accuracy based metrics, researchers have demonstrated a propensity by such algorithms to exaggerate the biases inherent in the data such as popularity or the affinity of users to certain kinds of content. Meanwhile, recommender systems have only grown in importance and have become an integral part of the internet ecosystem, with many users interacting with many recommender systems daily on e-commerce sites, social networks and apps. Therefore, the biases in recommender systems have come to critically impact a company’s bottom line, user satisfaction levels and public image, making it an imperative to develop recommendation diversification methods to explicitly counteract them. In this thesis we make three key contributions to the growing field of sales diversity, which aims to reduce popularity biases inherent in many collaborative filtering based recommender systems. First, we consider the problem of making item-item recommendations, with the goal of redundantly linking from popular items to less popular items in order to bring them more exposure on the web. Next, we consider to the setting of user-item recommendations, and develop a metric we call “discrepancy” to measure the distance between the recommendation distribution desired by a business and the distribution obtained by the recommender system, and develop algorithms to reduce discrepancy while maintaining high recommendation quality. Lastly, we turn our attention to item catalogs and user bases where items and users are clustered into disjoint or overlapping subgroups, and develop metrics to quantify the recommendation diversity experienced both by the users and the items. Our approaches to all three of these problems are unified under a framework of subgraph selection, the use of network flow problems for modeling, and a focus on providing either exact polynomial algorithms or efficient approximation algorithms with concrete performance guarantees. This stands in contrast with existing approaches, most of which are reranking based heuristics for which no performance guarantees can be given. In each of these settings, we augment our theoretical findings with an empirical evaluation on real life datasets from online retailers or standard recommender system datasets provided by Netflix and the MovieLens group, and show that our methods provide superior sales diversity value when compared with competing approaches. 2017-05-01T07:00:00Z text application/pdf http://repository.cmu.edu/dissertations/959 http://repository.cmu.edu/cgi/viewcontent.cgi?article=1998&context=dissertations Dissertations Research Showcase @ CMU
collection	NDLTD
format	Others
sources	NDLTD
description	Collaborative filtering approaches have produced some of the most accurate and personalized recommender systems to date by mining for similarities in large-scale datasets. However, despite their stellar performance in accuracy based metrics, researchers have demonstrated a propensity by such algorithms to exaggerate the biases inherent in the data such as popularity or the affinity of users to certain kinds of content. Meanwhile, recommender systems have only grown in importance and have become an integral part of the internet ecosystem, with many users interacting with many recommender systems daily on e-commerce sites, social networks and apps. Therefore, the biases in recommender systems have come to critically impact a company’s bottom line, user satisfaction levels and public image, making it an imperative to develop recommendation diversification methods to explicitly counteract them. In this thesis we make three key contributions to the growing field of sales diversity, which aims to reduce popularity biases inherent in many collaborative filtering based recommender systems. First, we consider the problem of making item-item recommendations, with the goal of redundantly linking from popular items to less popular items in order to bring them more exposure on the web. Next, we consider to the setting of user-item recommendations, and develop a metric we call “discrepancy” to measure the distance between the recommendation distribution desired by a business and the distribution obtained by the recommender system, and develop algorithms to reduce discrepancy while maintaining high recommendation quality. Lastly, we turn our attention to item catalogs and user bases where items and users are clustered into disjoint or overlapping subgroups, and develop metrics to quantify the recommendation diversity experienced both by the users and the items. Our approaches to all three of these problems are unified under a framework of subgraph selection, the use of network flow problems for modeling, and a focus on providing either exact polynomial algorithms or efficient approximation algorithms with concrete performance guarantees. This stands in contrast with existing approaches, most of which are reranking based heuristics for which no performance guarantees can be given. In each of these settings, we augment our theoretical findings with an empirical evaluation on real life datasets from online retailers or standard recommender system datasets provided by Netflix and the MovieLens group, and show that our methods provide superior sales diversity value when compared with competing approaches.
author	Antikacioglu, Arda
spellingShingle	Antikacioglu, Arda Quantifying and Improving Sales Diversity in Recommender Systems
author_facet	Antikacioglu, Arda
author_sort	Antikacioglu, Arda
title	Quantifying and Improving Sales Diversity in Recommender Systems
title_short	Quantifying and Improving Sales Diversity in Recommender Systems
title_full	Quantifying and Improving Sales Diversity in Recommender Systems
title_fullStr	Quantifying and Improving Sales Diversity in Recommender Systems
title_full_unstemmed	Quantifying and Improving Sales Diversity in Recommender Systems
title_sort	quantifying and improving sales diversity in recommender systems
publisher	Research Showcase @ CMU
publishDate	2017
url	http://repository.cmu.edu/dissertations/959 http://repository.cmu.edu/cgi/viewcontent.cgi?article=1998&context=dissertations
work_keys_str_mv	AT antikaciogluarda quantifyingandimprovingsalesdiversityinrecommendersystems
_version_	1718507619906224128

Quantifying and Improving Sales Diversity in Recommender Systems

Similar Items