Skip to yearly menu bar Skip to main content


Boosting Off-policy RL with Policy Representation and Policy-extended Value Function Approximator

Min Zhang ⋅ Jianye Hao ⋅ Hongyao Tang ⋅ Yan Zheng

Abstract

Chat is not available.