Poster
in
Workshop: Next Generation of AI Safety

Deciphering the Definition of Adversarial Robustness for post-hoc OOD Detectors

Peter Lorenz · Mario Fernandez · Jens Müller · Ullrich Koethe

Keywords: OOD Adversarial Examples post-hoc detectors

Project Page [ OpenReview]

Abstract

Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in real-world scenarios. In recent years, many OOD detectors have been developed, and even the benchmarking has been standardized, i.e. OpenOOD. The number of post-hoc detectors is growing fast and showing an option to protect a pre-trained classifier against natural distribution shifts, claiming to be ready for real-world scenarios. However, its efficacy in handling adversarial examples has been neglected in the majority of studies. This paper investigates the adversarial robustness of the 16 post-hoc detectors on several evasion attacks and discuss a roadmap towards adversarial defense in OOD detectors.

Video

Chat is not available.