Exploring the landscape of spatial robustness

Copyright 2019 by the author(s). The study of adversarial robustness has so far largely focused on perturbations bound in lvnorms. However, state-of-the-art models turn out to be also vulnerable to other, more natural classes of perturbations such as translations and rotations. In this work, we thor...

Full description

Bibliographic Details
Main Authors: Engstrom, Logan G. (Author), Tran, Brandon (Author), Tsipras, Dimitris (Author), Schmidt, Ludwig (Author), Madry, Aleksander (Author)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: MLResearch Press, 2021-04-06T15:52:40Z.
Subjects:
Online Access:Get fulltext
Description
Summary:Copyright 2019 by the author(s). The study of adversarial robustness has so far largely focused on perturbations bound in lvnorms. However, state-of-the-art models turn out to be also vulnerable to other, more natural classes of perturbations such as translations and rotations. In this work, we thoroughly investigate the vulnerability of neural network-based classifiers to rotations and translations. While data augmentation offers relatively small robustness, we use ideas from robust optimization and test-time input aggregation to significantly improve robustness. Finally we find that, in contrast to the ip-norm case, first-order methods cannot reliably find worst-case perturbations. This highlights spatial robustness as a fundamentally different setting requiring additional study.
NSF (Grants CCF-1553428, CNS-1413920, CCF-1553428 and CNS-1815221)