Skip to yearly menu bar Skip to main content


Oral
in
Workshop: ES-FoMo III: 3rd Workshop on Efficient Systems for Foundation Models
Sat, Jul 19, 2025 • 11:00 AM – 11:15 AM PDT

zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression

Saibo Geng · Nathan Thomas Elian Ranchin · Yunzhen Yao · Maxime Peyrard · Chris Wendler · Michael Gastpar · Robert West

Abstract

Video

Chat is not available.