Skip to yearly menu bar Skip to main content


WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

Sihan Chen ⋅ Dan Zhao ⋅ Jongwoo Ko ⋅ Colby Banbury ⋅ HUIPING ZHUANG ⋅ Luming Liang ⋅ Tianyi Chen

Abstract

Chat is not available.