Skip to yearly menu bar Skip to main content


WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

Sihan Chen · Dan Zhao · Jongwoo Ko · Colby Banbury · HUIPING ZHUANG · Luming Liang · Tianyi Chen

Abstract

Chat is not available.