Skip to yearly menu bar Skip to main content


Poster
in
Workshop: 2nd Workshop on Test-Time Adaptation: Putting Updates to the Test (PUT)
Fri, Jul 18, 2025 • 11:15 AM – 12:00 PM PDT

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Shenao Zhang · Yaqing Wang · Canoee Liu · Tianqi Liu · Peter Grabowski · Eugene Ie · Zhaoran Wang · Yunxuan Li

Abstract

Chat is not available.