Skip to yearly menu bar Skip to main content


Poster

Demystifying Entropy Control in LLM RL Training: Theoretical Analysis and Dynamic Scheduling

Jingchu Gai ⋅ Guanning Zeng ⋅ Huaqing Zhang ⋅ Han Zhong ⋅ Yige Hong ⋅ Andrej Risteski ⋅ Aditi Raghunathan

Abstract

Log in and register to view live content