Skip to yearly menu bar Skip to main content


MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Chanwoo Park · Seungju Han · Xingzhi Guo · Asuman Ozdaglar · Kaiqing Zhang · Joo-Kyung Kim

Abstract

Video

Chat is not available.