Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

Can Language Models Safeguard Themselves, Instantly and For Free?

Dyah Adila · Changho Shin · Yijing Zhang · Frederic Sala

Abstract

Video

Chat is not available.