Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
A fundamental challenge in contextual bandits is to develop flexible, general-purpose algorithms with computational requirements no worse than classical supervised learning tasks such as classification and regression. Algorithms based on regression have shown promising empirical success, but theoret...
Main Authors: | Foster, Dylan J (Author), Rakhlin, Alexander (Author) |
---|---|
Format: | Article |
Language: | English |
Published: |
2021-12-03T15:09:50Z.
|
Subjects: | |
Online Access: | Get fulltext |
Similar Items
-
Contextual Bandit Learning With Reward Oracles and Sampling Guidance in Multi-Agent Environments
by: Mike Li, et al.
Published: (2021-01-01) -
Bayesian Contextual Bandits for Hyper Parameter Optimization
by: Guoxin Sui, et al.
Published: (2020-01-01) -
Contextual bandits with cross-learning
by: Balseiro, Santiago, et al.
Published: (2021) -
Contextual bandits with cross-learning
Published: (2021) -
Non-linear contextual bandits
by: Chia, John
Published: (2012)