Skip to yearly menu bar Skip to main content


Poster

Transformers with RL or SFT Provably Learn Sparse Boolean Functions, But Differently

Bochen Lyu ⋅ Yiyang Jia ⋅ Xiaohao Cai ⋅ Zhanxing Zhu

Abstract

Log in and register to view live content