Skip to yearly menu bar Skip to main content


Poster Tue, Jul 15, 2025 • 4:30 PM – 7:00 PM PDT

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Maohao Shen · Guangtao Zeng · Zhenting Qi · Zhang-Wei Hong · Zhenfang Chen · Wei Lu · Gregory Wornell · Subhro Das · David Cox · Chuang Gan

Abstract

Lay Summary

Video

Chat is not available.