Skip to yearly menu bar Skip to main content


Poster

Self-Captioning Multimodal Interaction Tuning: Amplifying Exploitable Redundancies for Robust Vision Language Models

Yuriel Ryan ⋅ Ip Man ⋅ Adriel Kuek ⋅ Paul Pu Liang ⋅ Roy Lee

Abstract

Log in and register to view live content