Skip to yearly menu bar Skip to main content


Chasing Moving Targets with Online Self-Play Reinforcement Learning for Safer Language Models

Mickel Liu · Liwei Jiang · Yancheng Liang · Simon Du · Yejin Choi · Tim Althoff · Natasha Jaques

Abstract

Chat is not available.