Learning from interaction : models and applications

A large proportion of Machine Learning (ML) research focuses on designing algorithms that require minimal input from the human. However, ML algo- rithms are now widely used in various areas of engineering to design and build systems that interact with the human user and thus need to “learn” from thi...

Full description

Bibliographic Details
Main Author:	Glowacka, D.
Published:	University College London (University of London) 2012
Subjects:	004
Online Access:	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.568291

id	ndltd-bl.uk-oai-ethos.bl.uk-568291
record_format	oai_dc
spelling	ndltd-bl.uk-oai-ethos.bl.uk-5682912015-12-03T03:29:38ZLearning from interaction : models and applicationsGlowacka, D.2012A large proportion of Machine Learning (ML) research focuses on designing algorithms that require minimal input from the human. However, ML algo- rithms are now widely used in various areas of engineering to design and build systems that interact with the human user and thus need to “learn” from this interaction. In this work, we concentrate on algorithms that learn from user interaction. A significant part of the dissertation is devoted to learning in the bandit setting. We propose a general framework for handling dependencies across arms, based on the new assumption that the mean-reward function is drawn from a Gaussian Process. Additionally, we propose an alternative method for arm selection using Thompson sampling and we apply the new algorithms to a grammar learning problem. In the remainder of the dissertation, we consider content-based image re- trieval in the case when the user is unable to specify the required content through tags or other image properties and so the system must extract infor- mation from the user through limited feedback. We present a novel Bayesian approach that uses latent random variables to model the systems imperfect knowledge about the users expected response to the images. An impor- tant aspect of the algorithm is the incorporation of an explicit exploration- exploitation strategy in the image sampling process. A second aspect of our algorithm is the way in which its knowledge of the target image is updated given user feedback. We considered a few algorithms to do so: variational Bayes, Gibbs sampling and a simple uniform update. We show in experi- ments that the simple uniform update performs best. The reason is because, unlike the uniform update, both variational Bayes and Gibbs sampling tend to focus on a small set of images aggressively.004University College London (University of London)http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.568291http://discovery.ucl.ac.uk/1369566/Electronic Thesis or Dissertation
collection	NDLTD
sources	NDLTD
topic	004
spellingShingle	004 Glowacka, D. Learning from interaction : models and applications
description	A large proportion of Machine Learning (ML) research focuses on designing algorithms that require minimal input from the human. However, ML algo- rithms are now widely used in various areas of engineering to design and build systems that interact with the human user and thus need to “learn” from this interaction. In this work, we concentrate on algorithms that learn from user interaction. A significant part of the dissertation is devoted to learning in the bandit setting. We propose a general framework for handling dependencies across arms, based on the new assumption that the mean-reward function is drawn from a Gaussian Process. Additionally, we propose an alternative method for arm selection using Thompson sampling and we apply the new algorithms to a grammar learning problem. In the remainder of the dissertation, we consider content-based image re- trieval in the case when the user is unable to specify the required content through tags or other image properties and so the system must extract infor- mation from the user through limited feedback. We present a novel Bayesian approach that uses latent random variables to model the systems imperfect knowledge about the users expected response to the images. An impor- tant aspect of the algorithm is the incorporation of an explicit exploration- exploitation strategy in the image sampling process. A second aspect of our algorithm is the way in which its knowledge of the target image is updated given user feedback. We considered a few algorithms to do so: variational Bayes, Gibbs sampling and a simple uniform update. We show in experi- ments that the simple uniform update performs best. The reason is because, unlike the uniform update, both variational Bayes and Gibbs sampling tend to focus on a small set of images aggressively.
author	Glowacka, D.
author_facet	Glowacka, D.
author_sort	Glowacka, D.
title	Learning from interaction : models and applications
title_short	Learning from interaction : models and applications
title_full	Learning from interaction : models and applications
title_fullStr	Learning from interaction : models and applications
title_full_unstemmed	Learning from interaction : models and applications
title_sort	learning from interaction : models and applications
publisher	University College London (University of London)
publishDate	2012
url	http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.568291
work_keys_str_mv	AT glowackad learningfrominteractionmodelsandapplications
_version_	1718141644026413056

Learning from interaction : models and applications

Similar Items