Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Pluralistic Alignment Workshop

Helpful or Safe? UltraFeedback's Binarized Labels Encode a Value Tradeoff

Jingyi Zhang

Abstract

Log in and register to view live content