Skip to yearly menu bar Skip to main content


Poster

AReaL-DTA: Dynamic Tree Attention for Efficient Reinforcement Learning of Large Language Models

Jiarui Zhang ⋅ Yuchen Yang ⋅ Ran Yan ⋅ Zhiyu Mei ⋅ Liyuan Zhang ⋅ LiDaifeng ⋅ Wei Fu ⋅ Jiaxuan Gao ⋅ Shusheng Xu ⋅ Yi Wu ⋅ Binhang Yuan

Abstract

Log in and register to view live content