Skip to yearly menu bar Skip to main content


Defending Against Prompt Injection with a Few DefensiveTokens

Sizhe Chen · Yizhu Wang · Nicholas Carlini · Chawin Sitawarin · David Wagner

Abstract

Chat is not available.