Skip to yearly menu bar Skip to main content


Simple linear attention language models balance the recall-throughput tradeoff

Simran Arora · Sabri Eyuboglu · Michael Zhang · Aman Timalsina · Silas Alberti · Dylan Zinsley · James Zou · Atri Rudra · Christopher Re

Abstract

Video

Chat is not available.