Poster Wed, Jul 8, 2026 • 10:30 PM – 12:15 AM PDT HALL A #112

Recursive Monte-Carlo Tree Search

Benjamin Howard ⋅ Keith Frankston

Project Page

Abstract

We introduce a recursive AlphaZero style Monte--Carlo tree search algorithm, "RMCTS". It first generates the search tree using prior policies, and then recursively re-estimates action values by using the regularized optimal posterior policies from ``Monte--Carlo tree search as regularized policy optimization'' (Grill et al., 2020) at each node of the search tree, starting from the leaves and working back up to the root. We find that RMCTS matches or exceeds the quality of AlphaZero's MCTS-UCB in a tiny fraction of the time.