Regret bounds for Gaussian process bandits without observation noise

This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [1...

Full description

Bibliographic Details
Main Author: Zoghi, Masrour
Language:English
Published: University of British Columbia 2012
Online Access:http://hdl.handle.net/2429/42865
id ndltd-UBC-oai-circle.library.ubc.ca-2429-42865
record_format oai_dc
spelling ndltd-UBC-oai-circle.library.ubc.ca-2429-428652018-01-05T17:25:59Z Regret bounds for Gaussian process bandits without observation noise Zoghi, Masrour This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [11] in a complementary setting. Given a function f on a domain D ⊆ R^d , sampled from a Gaussian process with an anisotropic kernel that is four times differentiable at 0, and a lattice L ⊆ D, we show that if the points in L are chosen for sampling using our branch-and-bound algorithm, the regret asymptotically decreases according to O(e^{τt/(ln t)^{d/4}}) with high probability, where t is the number of observations carried out so far and τ is a constant that depends on the objective function. Science, Faculty of Computer Science, Department of Graduate 2012-08-02T23:43:08Z 2012-08-02T23:43:08Z 2012 2012-11 Text Thesis/Dissertation http://hdl.handle.net/2429/42865 eng Attribution-NonCommercial-ShareAlike 3.0 Unported http://creativecommons.org/licenses/by-nc-sa/3.0/ University of British Columbia
collection NDLTD
language English
sources NDLTD
description This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [11] in a complementary setting. Given a function f on a domain D ⊆ R^d , sampled from a Gaussian process with an anisotropic kernel that is four times differentiable at 0, and a lattice L ⊆ D, we show that if the points in L are chosen for sampling using our branch-and-bound algorithm, the regret asymptotically decreases according to O(e^{τt/(ln t)^{d/4}}) with high probability, where t is the number of observations carried out so far and τ is a constant that depends on the objective function. === Science, Faculty of === Computer Science, Department of === Graduate
author Zoghi, Masrour
spellingShingle Zoghi, Masrour
Regret bounds for Gaussian process bandits without observation noise
author_facet Zoghi, Masrour
author_sort Zoghi, Masrour
title Regret bounds for Gaussian process bandits without observation noise
title_short Regret bounds for Gaussian process bandits without observation noise
title_full Regret bounds for Gaussian process bandits without observation noise
title_fullStr Regret bounds for Gaussian process bandits without observation noise
title_full_unstemmed Regret bounds for Gaussian process bandits without observation noise
title_sort regret bounds for gaussian process bandits without observation noise
publisher University of British Columbia
publishDate 2012
url http://hdl.handle.net/2429/42865
work_keys_str_mv AT zoghimasrour regretboundsforgaussianprocessbanditswithoutobservationnoise
_version_ 1718583415476846592