Regret bounds for Gaussian process bandits without observation noise

This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [1...

Full description

Bibliographic Details
Main Author:	Zoghi, Masrour
Language:	English
Published:	University of British Columbia 2012
Online Access:	http://hdl.handle.net/2429/42865

id	ndltd-UBC-oai-circle.library.ubc.ca-2429-42865
record_format	oai_dc
spelling	ndltd-UBC-oai-circle.library.ubc.ca-2429-428652018-01-05T17:25:59Z Regret bounds for Gaussian process bandits without observation noise Zoghi, Masrour This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [11] in a complementary setting. Given a function f on a domain D ⊆ R^d , sampled from a Gaussian process with an anisotropic kernel that is four times differentiable at 0, and a lattice L ⊆ D, we show that if the points in L are chosen for sampling using our branch-and-bound algorithm, the regret asymptotically decreases according to O(e^{τt/(ln t)^{d/4}}) with high probability, where t is the number of observations carried out so far and τ is a constant that depends on the objective function. Science, Faculty of Computer Science, Department of Graduate 2012-08-02T23:43:08Z 2012-08-02T23:43:08Z 2012 2012-11 Text Thesis/Dissertation http://hdl.handle.net/2429/42865 eng Attribution-NonCommercial-ShareAlike 3.0 Unported http://creativecommons.org/licenses/by-nc-sa/3.0/ University of British Columbia
collection	NDLTD
language	English
sources	NDLTD
description	This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [11] in a complementary setting. Given a function f on a domain D ⊆ R^d , sampled from a Gaussian process with an anisotropic kernel that is four times differentiable at 0, and a lattice L ⊆ D, we show that if the points in L are chosen for sampling using our branch-and-bound algorithm, the regret asymptotically decreases according to O(e^{τt/(ln t)^{d/4}}) with high probability, where t is the number of observations carried out so far and τ is a constant that depends on the objective function. === Science, Faculty of === Computer Science, Department of === Graduate
author	Zoghi, Masrour
spellingShingle	Zoghi, Masrour Regret bounds for Gaussian process bandits without observation noise
author_facet	Zoghi, Masrour
author_sort	Zoghi, Masrour
title	Regret bounds for Gaussian process bandits without observation noise
title_short	Regret bounds for Gaussian process bandits without observation noise
title_full	Regret bounds for Gaussian process bandits without observation noise
title_fullStr	Regret bounds for Gaussian process bandits without observation noise
title_full_unstemmed	Regret bounds for Gaussian process bandits without observation noise
title_sort	regret bounds for gaussian process bandits without observation noise
publisher	University of British Columbia
publishDate	2012
url	http://hdl.handle.net/2429/42865
work_keys_str_mv	AT zoghimasrour regretboundsforgaussianprocessbanditswithoutobservationnoise
_version_	1718583415476846592

Regret bounds for Gaussian process bandits without observation noise

Similar Items