Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

© 2020 IEEE. A common approach for defining a reward function for multi-objective reinforcement learning (MORL) problems is the weighted sum of the multiple objectives. The weights are then treated as design parameters dependent on the expertise (and preference) of the person performing the learning...

Full description

Bibliographic Details
Main Authors:	Kusari, Arpan (Author), How, Jonathan P. (Author)
Format:	Article
Language:	English
Published:	IEEE, 2021-10-28T15:57:58Z.
Subjects:	Article
Online Access:	Get fulltext

Internet

Get fulltext

Predicting optimal value functions by interpolating reward functions in scalarized multi-objective reinforcement learning

Internet

Similar Items