Skip to yearly menu bar Skip to main content


Poster

What Reward Structure Enables Efficient Sparse-Reward RL? A Proof-of-Concept with Policy-Aware Matrix Completion

Ibne Farabi Shihab ⋅ SANJEDA AKTER ⋅ Anuj Sharma

Abstract

Log in and register to view live content