Skip to yearly menu bar Skip to main content


Poster

Safety Alignment of LMs via Non-cooperative Games

Anselm Paulus ⋅ Ilia Kulikov ⋅ Brandon Amos ⋅ REMI MUNOS ⋅ Ivan Evtimov ⋅ Kamalika Chaudhuri ⋅ Arman Zharmagambetov

Abstract

Log in and register to view live content