Skip to yearly menu bar Skip to main content


Poster

Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training

LAI Song ⋅ Haohan Zhao ⋅ Rong Feng ⋅ Changyi Ma ⋅ Wenzhuo Liu ⋅ Hongbo Zhao ⋅ Xi Lin ⋅ Dong Yi ⋅ Qingfu Zhang ⋅ Hongbin Liu ⋅ Gaofeng Meng ⋅ Fei Zhu

Abstract

Log in and register to view live content