Skip to yearly menu bar Skip to main content


Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning

Tengyang Xie ⋅ Nan Jiang ⋅ Huan Wang ⋅ Caiming Xiong ⋅ Yu Bai

Abstract

Chat is not available.