Skip to yearly menu bar Skip to main content


Poster

Eliciting Language Model Behaviors with Investigator Agents

Xiang Li ⋅ Neil Chowdhury ⋅ Daniel Johnson ⋅ Tatsunori Hashimoto ⋅ Percy Liang ⋅ Sarah Schwettmann ⋅ Jacob Steinhardt
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.