Skip to yearly menu bar Skip to main content


Poster

Policy Filtration for RLHF to Mitigate Noise in Reward Models

Chuheng Zhang · Wei Shen · Li Zhao · Xuyun Zhang · Xiaolong Xu · Wanchun Dou · Jiang Bian
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.