Regret bounds for Gaussian process bandits without observation noise
This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [1...
Main Author: | |
---|---|
Language: | English |
Published: |
University of British Columbia
2012
|
Online Access: | http://hdl.handle.net/2429/42865 |
id |
ndltd-UBC-oai-circle.library.ubc.ca-2429-42865 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-UBC-oai-circle.library.ubc.ca-2429-428652018-01-05T17:25:59Z Regret bounds for Gaussian process bandits without observation noise Zoghi, Masrour This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [11] in a complementary setting. Given a function f on a domain D ⊆ R^d , sampled from a Gaussian process with an anisotropic kernel that is four times differentiable at 0, and a lattice L ⊆ D, we show that if the points in L are chosen for sampling using our branch-and-bound algorithm, the regret asymptotically decreases according to O(e^{τt/(ln t)^{d/4}}) with high probability, where t is the number of observations carried out so far and τ is a constant that depends on the objective function. Science, Faculty of Computer Science, Department of Graduate 2012-08-02T23:43:08Z 2012-08-02T23:43:08Z 2012 2012-11 Text Thesis/Dissertation http://hdl.handle.net/2429/42865 eng Attribution-NonCommercial-ShareAlike 3.0 Unported http://creativecommons.org/licenses/by-nc-sa/3.0/ University of British Columbia |
collection |
NDLTD |
language |
English |
sources |
NDLTD |
description |
This thesis presents some statistical refinements of the bandits approach presented in [11] in the situation where there is no observation noise. We give an improved bound on the cumulative regret of the samples chosen by an algorithm that is related (though not identical) to the UCB algorithm of [11] in a complementary setting. Given a function f on a domain D ⊆ R^d , sampled from a Gaussian process with an anisotropic kernel that is four times differentiable at 0, and a lattice L ⊆ D, we show that if the points in L are chosen for sampling using our branch-and-bound algorithm, the regret asymptotically decreases according to O(e^{τt/(ln t)^{d/4}}) with high probability,
where t is the number of observations carried out so far and τ is a constant that depends on the objective function. === Science, Faculty of === Computer Science, Department of === Graduate |
author |
Zoghi, Masrour |
spellingShingle |
Zoghi, Masrour Regret bounds for Gaussian process bandits without observation noise |
author_facet |
Zoghi, Masrour |
author_sort |
Zoghi, Masrour |
title |
Regret bounds for Gaussian process bandits without observation noise |
title_short |
Regret bounds for Gaussian process bandits without observation noise |
title_full |
Regret bounds for Gaussian process bandits without observation noise |
title_fullStr |
Regret bounds for Gaussian process bandits without observation noise |
title_full_unstemmed |
Regret bounds for Gaussian process bandits without observation noise |
title_sort |
regret bounds for gaussian process bandits without observation noise |
publisher |
University of British Columbia |
publishDate |
2012 |
url |
http://hdl.handle.net/2429/42865 |
work_keys_str_mv |
AT zoghimasrour regretboundsforgaussianprocessbanditswithoutobservationnoise |
_version_ |
1718583415476846592 |