Learning to Optimize Under Non-Stationarity

© 2019 by the author(s). We introduce algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary linear stochastic bandit setting. It captures natural applications such as dynamic pricing and ads allocation in a changing environment. We show how the difficulty posed by the non...

Full description

Bibliographic Details
Main Authors: Cheung, Wang Chi (Author), Simchi-Levi, David (Author), Zhu, Ruihao (Author)
Format: Article
Language:English
Published: Elsevier BV, 2021-11-02T12:19:41Z.
Subjects:
Online Access:Get fulltext