Bandit algorithms with graphical feedback models and privacy awareness
This thesis focuses on two classes of learning problems in stochastic multi-armed bandits (MAB): graphical bandits and private bandits. Different from the basic MAB setting where the learning algorithm can only have one observation,for a bandit problem under a graphical feedback model,...
Main Author: | Hu, Bingshan |
---|---|
Other Authors: | Mehta, Nishant A. |
Format: | Others |
Language: | English en |
Published: |
2021
|
Online Access: | http://hdl.handle.net/1828/13411 |
Similar Items
-
Efficient Online Learning with Bandit Feedback
by: Liu, Fang
Published: (2020) -
Multi-armed bandits with unconventional feedback
by: Gajane, Pratik
Published: (2017) -
Contributions to Multi-Armed Bandits : Risk-Awareness and Sub-Sampling for Linear Contextual Bandits
by: Galichet, Nicolas
Published: (2015) -
Bandit feedback in Classification and Multi-objective Optimization
by: Zhong, Hongliang
Published: (2016) -
Online Combinatorial Optimization under Bandit Feedback
by: Talebi Mazraeh Shahi, Mohammad Sadegh
Published: (2016)