Skip to yearly menu bar Skip to main content


Oral
in
Workshop: Tiny Titans: The next wave of On-Device Learning for Foundation Models (TTODLer-FM)
Fri, Jul 18, 2025 • 2:15 PM – 2:30 PM PDT

Offloaded Reasoning: Efficient Inference for Large Language Models via Modular Reasoning and Refinement

Ishan Jindal · Jayant Taneja · Badrinath chandana · Vikas Kapur · SACHIN SHARMA

Abstract

Video

Chat is not available.