Skip to yearly menu bar Skip to main content


Poster

Efficient Preference Poisoning Attack on Offline RLHF

Chenye Yang ⋅ Weiyu Xu ⋅ Lifeng Lai

Abstract

Log in and register to view live content