Skip to yearly menu bar Skip to main content


Boosting Off-policy RL with Policy Representation and Policy-extended Value Function Approximator

Min Zhang · Jianye Hao · Hongyao Tang · Yan Zheng

Abstract

Chat is not available.