Risk-aware multi-armed bandit problem with application to portfolio selection

Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential dec...

Full description

Bibliographic Details
Main Authors: Xiaoguang Huo, Feng Fu
Format: Article
Language:English
Published: The Royal Society 2017-01-01
Series:Royal Society Open Science
Subjects:
Online Access:https://royalsocietypublishing.org/doi/pdf/10.1098/rsos.171377