Skip to yearly menu bar Skip to main content


Poster

Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models

Mickel Liu ⋅ Liwei Jiang ⋅ Yancheng Liang ⋅ Simon Du ⋅ Yejin Choi ⋅ Tim Althoff ⋅ Natasha Jaques

Abstract

Log in and register to view live content