Skip to yearly menu bar Skip to main content


Poster
in
Workshop: RLxF: RL from World Feedback

On-Policy Self-Distillation via Prompt Optimization

Jongho Park ⋅ Donghyun Lee ⋅ Matei Zaharia ⋅ Jason Lee

Abstract

Log in and register to view live content