Skip to yearly menu bar Skip to main content


Poster

Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization

Yunhan Bu ⋅ quan zhang ⋅ Zhang Huaping ⋅ Guotong Geng ⋅ Chunxiao Gao ⋅ Askar Hamdulla ⋅ Juan Wang ⋅ Qiuchi Li ⋅ Baohua Zhang ⋅ Yunbo Cao ⋅ Zhunchen Luo ⋅ Shuai Lei

Abstract

Log in and register to view live content