Skip to yearly menu bar Skip to main content


Reinforcing Question Answering Agents with Minimalist Policy Gradient Optimization

Yihong Wu ⋅ Liheng Ma ⋅ Muzhi Li ⋅ Jiaming Zhou ⋅ Ho-fung Leung ⋅ Jianye Hao ⋅ Irwin King ⋅ Yingxue Zhang ⋅ Jian-Yun Nie

Abstract

Chat is not available.