Skip to yearly menu bar Skip to main content


Offloaded Reasoning: Efficient Inference for Large Language Models via Modular Reasoning and Refinement

Ishan Jindal · Jayant Taneja · Badrinath chandana · Vikas Kapur · SACHIN SHARMA

Abstract

Video

Chat is not available.