Skip to yearly menu bar Skip to main content


Poster

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

Shumin Wang ⋅ Yuexiang Xie ⋅ Wenhao Zhang ⋅ Yuchang Sun ⋅ Yanxi Chen ⋅ Yaliang Li ⋅ Yanyong Zhang

Abstract

Log in and register to view live content