Skip to yearly menu bar Skip to main content


How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability?

Hongkang Li ⋅ Meng Wang ⋅ Songtao Lu ⋅ Xiaodong Cui ⋅ Pin-Yu Chen

Abstract

Chat is not available.