Multiparty dynamics and failure modes for machine learning and artificial intelligence

An important challenge for safety in machine learning and artificial intelligence systems is a set of related failures involving specification gaming, reward hacking, fragility to distributional shifts, and Goodhart’s or Campbell’s law. This paper presents additional failure modes for interactions w...

Full description

Bibliographic Details
Main Author:	Manheim, D. (Author)
Format:	Article
Language:	English
Published:	MDPI AG 2019
Subjects:	Artificial intelligence safety Goodhart’s Law Multi-agent systems Specification gaming
Online Access:	View Fulltext in Publisher

Internet

View Fulltext in Publisher

Multiparty dynamics and failure modes for machine learning and artificial intelligence

Internet

Similar Items