Skip to yearly menu bar Skip to main content


Scavenging Hyena: Distilling Transformers into Long Convolution Models

Tokiniaina Ralambomihanta ⋅ Shahrad Mohammadzadeh ⋅ Mohammad Sami Nur Islam ⋅ Wassim Jabbour ⋅ Laurence Liang

Abstract

Chat is not available.