Skip to yearly menu bar Skip to main content


One-Shot Safety Alignment for Large Language Models via Optimal Dualization

Xinmeng Huang ⋅ Shuo Li ⋅ Edgar Dobriban ⋅ Osbert Bastani ⋅ Hamed Hassani ⋅ Dongsheng Ding

Abstract

Chat is not available.