Skip to yearly menu bar Skip to main content


Poster Thu, Jul 17, 2025 • 11:00 AM – 1:30 PM PDT

Eliciting Language Model Behaviors with Investigator Agents

Xiang Li · Neil Chowdhury · Daniel Johnson · Tatsunori Hashimoto · Percy Liang · Sarah Schwettmann · Jacob Steinhardt

Abstract

Lay Summary

Video

Chat is not available.