Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles

Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles

A fundamental challenge in contextual bandits is to develop flexible, general-purpose algorithms with computational requirements no worse than classical supervised learning tasks such as classification and regression. Algorithms based on regression have shown promising empirical success, but theoret...

Full description

Bibliographic Details
Main Authors:	Foster, Dylan J (Author), Rakhlin, Alexander (Author)
Format:	Article
Language:	English
Published:	2021-12-03T15:09:50Z.
Subjects:	Article
Online Access:	Get fulltext

Similar Items

Contextual Bandit Learning With Reward Oracles and Sampling Guidance in Multi-Agent Environments
by: Mike Li, et al.
Published: (2021-01-01)

Bayesian Contextual Bandits for Hyper Parameter Optimization
by: Guoxin Sui, et al.
Published: (2020-01-01)

Contextual bandits with cross-learning
by: Balseiro, Santiago, et al.
Published: (2021)

Contextual bandits with cross-learning
Published: (2021)

Non-linear contextual bandits
by: Chia, John
Published: (2012)

Non-linear contextual bandits
by: Chia, John
Published: (2012)

Non-linear contextual bandits
by: Chia, John
Published: (2012)

Contributions to Multi-Armed Bandits : Risk-Awareness and Sub-Sampling for Linear Contextual Bandits
by: Galichet, Nicolas
Published: (2015)

Study on Contextual Bandit Problem with Multiple Actions
by: Ya-Hsuan Chang, et al.
Published: (2013)

Pseudo-reward Algorithms for Linear Contextual Bandit Problems
by: Ku-Chun Chou, et al.
Published: (2013)

Constrained Contextual Bandit Learning for Adaptive Radar Waveform Selection
by: Buehrer, R.M, et al.
Published: (2022)

Contextual hierarchies in the Shang oracle-bone inscriptions
by: Fowler, Vernon Keith
Published: (2010)

Prerequisites for establishing a public human UCB SCB; assessment of public acceptance and resistance of UCB to HIV
by: Meissner-Roloff, Madelein
Published: (2013)

Data-driven evaluation of contextual bandit algorithms and applications to dynamic recommendation
by: Nicol, Olivier
Published: (2014)

UCB Library to preserve ag literature
Published: (1996-11-01)

StreamingBandit: Experimenting with Bandit Policies
by: Jules Kruijswijk, et al.
Published: (2020-08-01)

Bayesian sampling in contextual-bandit problems with extensions to unknown normal-form games
by: May, Benedict C.
Published: (2013)

Using Contextual Multi-Armed Bandit Algorithms for Recommending Investment in Stock Market
by: Zhi-hua Chien, et al.
Published: (2016)

Efficient Online Learning with Bandit Feedback
by: Liu, Fang
Published: (2020)

Weighted quantile regression and oracle model selection.
Published: (2009)

Grandpa Bandit
by: Dunn, Jasmine D.
Published: (2012)

Online Combinatorial Optimization under Bandit Feedback
by: Talebi Mazraeh Shahi, Mohammad Sadegh
Published: (2016)

Optimization as estimation with Gaussian processes in bandit settings
by: Wang, Zi, Ph.D. Massachusetts Institute of Technology
Published: (2016)

Bandit feedback in Classification and Multi-objective Optimization
by: Zhong, Hongliang
Published: (2016)

Sharp oracle inequalities in aggregation and shape restricted regression
by: Bellec, Pierre C.
Published: (2016)

hMSCs from UCB: Isolation, Characterization and Determination of Osmotic Properties for Optimal Cryopreservation
by: E. Casula, et al.
Published: (2015-05-01)

Women in Herodotus’ Oracles. A Look beyond the Pythia
by: Carmen Sánchez-Mañas
Published: (2018-06-01)

UCB-Based Route and Power Selection Optimization for SDN-Enabled Industrial IoT in Smart Grid
by: Chen, X., et al.
Published: (2022)

A case series on UCB seroprevalance of Human T-lymphotropic virus
by: S. Seal, et al.
Published: (2020-12-01)

On collective bandit behaviour
by: Lecheval, Valentin
Published: (2014)

Structured Stochastic Bandits
by: Magureanu, Stefan
Published: (2016)

Linearly parameterized bandits
by: Tsitsiklis, John N., et al.
Published: (2012)

Batched Bandit Problems
by: Perchet, Vianney, et al.
Published: (2015)

Architects, Bandits and Knights
by: Konstantin Lidin
Published: (2006-03-01)

Bandit processes with covariates
by: Liang, You.
Published: (2013)

Bandit processes with covariates
by: Liang, You.
Published: (2013)

Beyond the brotherhood: Skoal Bandits’ role in the evolution of marketing moist smokeless tobacco pouches
by: Yogi H. Hendlin, et al.
Published: (2017-12-01)

Honorable Bandit A Walk across Corsica
Published: (2007)

Oracle Multimaster Replication Maintance Optimization
by: Hakik Paci, et al.
Published: (2011-01-01)

Convex optimization using quantum oracles
by: Joran van Apeldoorn, et al.
Published: (2020-01-01)