Skip to yearly menu bar Skip to main content


Poster

POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization

Batuhan K. Karaman ⋅ ishmam zabir ⋅ Alon Benhaim ⋅ Vishrav Chaudhary ⋅ Mert Sabuncu ⋅ Xia Song
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.