Skip to yearly menu bar Skip to main content


Poster

ASTRA: Communication-Efficient Acceleration for Multi-Device Transformer Inference

Xiao Liu ⋅ Lijun Zhang ⋅ Deepak Ganesan ⋅ Hui Guan

Abstract

Log in and register to view live content