Skip to yearly menu bar Skip to main content


Poster

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Kenneth Li ⋅ Samy Jelassi ⋅ Hugh Zhang ⋅ Sham Kakade ⋅ Martin Wattenberg ⋅ David Brandfonbrener
2024 Poster

Abstract

Chat is not available.