Skip to yearly menu bar Skip to main content


Oral
in
Workshop: ICML 2025 Workshop on Collaborative and Federated Agentic Workflows (CFAgentic @ ICML'25)
Sat, Jul 19, 2025 • 1:35 PM – 1:45 PM PDT

Generalizing Trust: Weak-to-Strong Trustworthiness in Language Models

Lillian Sun · Martin Pawelczyk · Zhenting Qi · Aounon Kumar · Himabindu Lakkaraju

Abstract

Video

Chat is not available.