Skip to yearly menu bar Skip to main content


Oral

Improving Transformers with Dynamically Composable Multi-Head Attention

Da Xiao ⋅ Qingye Meng ⋅ Shengping Li ⋅ xingyuan yuan
2024 Oral

Abstract

Video

Chat is not available.