Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Next Generation of AI Safety

A statistical framework for weak-to-strong generalization

Seamus Somerstep ⋅ Felipe Maia Polo ⋅ Moulinath Banerjee ⋅ Yaacov Ritov ⋅ Mikhail Yurochkin ⋅ Yuekai Sun

Abstract

Chat is not available.