Skip to yearly menu bar Skip to main content


Poster

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Kenneth Li · Samy Jelassi · Hugh Zhang · Sham Kakade · Martin Wattenberg · David Brandfonbrener
2024 Poster

Abstract

Chat is not available.