Skip to yearly menu bar Skip to main content


Offloaded Reasoning: Efficient Inference for Large Language Models via Modular Reasoning and Refinement

Ishan Jindal ⋅ Jayant Taneja ⋅ Badrinath chandana ⋅ Vikas Kapur ⋅ SACHIN SHARMA

Abstract

Video

Chat is not available.