Skip to yearly menu bar Skip to main content


Poster

TGPO: Efficient Policy Optimization through Sequence Anchor and Information Gating

Hang Ding ⋅ Dongqi Liu ⋅ Qiming Feng ⋅ Jian Li ⋅ Tong Lei ⋅ Jiafu Wu ⋅ Shuo Wang ⋅ Jiangning Zhang ⋅ Chengjie Wang ⋅ Yabiao Wang

Abstract

Log in and register to view live content