Multiparty dynamics and failure modes for machine learning and artificial intelligence
An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart’s or Campbell’s law. This paper presents additional failure modes for interactions w...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019
|
Subjects: | |
Online Access: | View Fulltext in Publisher |