Skip to yearly menu bar Skip to main content


Scavenging Hyena: Distilling Transformers into Long Convolution Models

Tokiniaina Ralambomihanta · Shahrad Mohammadzadeh · Mohammad Sami Nur Islam · Wassim Jabbour · Laurence Liang

Abstract

Chat is not available.