Multiparty dynamics and failure modes for machine learning and artificial intelligence

An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart’s or Campbell’s law. This paper presents additional failure modes for interactions w...

Full description

Bibliographic Details
Main Author: Manheim, D. (Author)
Format: Article
Language:English
Published: MDPI AG 2019
Subjects:
Online Access:View Fulltext in Publisher