Skip to yearly menu bar Skip to main content


Poster

Reward Shaping for Inference-Time Alignment: A Stackelberg Game Perspective

Haichuan Wang ⋅ Tao Lin ⋅ Lingkai Kong ⋅ Ce Li ⋅ Hezi Jiang ⋅ Milind Tambe

Abstract

Log in and register to view live content