Skip to yearly menu bar Skip to main content


Poster

Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

Bhrij Patel ⋅ Wesley A. Suttle ⋅ Alec Koppel ⋅ Vaneet Aggarwal ⋅ Brian Sadler ⋅ Dinesh Manocha ⋅ Amrit Singh Bedi
2024 Poster

Abstract

Chat is not available.