Skip to yearly menu bar Skip to main content


Poster

SafeCompass: Dynamic Chain-of-Thought Steering via Inference-Time Safety Signals

Zeyang Zhang ⋅ HAOTIAN XU ⋅ Linbao Li ⋅ Qi Sun ⋅ Xuebo Liu ⋅ YU LI ⋅ Cheng Zhuo

Abstract

Log in and register to view live content