firstbacksecondback
4 Results
Workshop
|
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback Yannick Metz · David Lindner · Raphaël Baur · Daniel Keim · Mennatallah El-Assady |
||
Workshop
|
Fri 19:00 |
Is RLHF More Difficult than Standard RL? Chi Jin |
|
Workshop
|
Sat 16:40 |
Beyond RLHF: A Human-Centered Approach to AI Development and Evaluation by Meredith Ringel Morris |
|
Workshop
|
Training Diffusion Models with Reinforcement Learning Kevin Black · Michael Janner · Yilun Du · Ilya Kostrikov · Sergey Levine |