Skip to yearly menu bar Skip to main content


Poster

Gram2Token: Enabling Run-time GPU-Native Grammar-Constrained Decoding for LLMs

Hantao Hua ⋅ Jiming Su ⋅ hao tang ⋅ Yiping Yao ⋅ Feng Zhu

Abstract

Log in and register to view live content