Skip to yearly menu bar Skip to main content


Is a Good Description Worth a Thousand Pictures? Reducing Multimodal Alignment to Text-Based, Unimodal Alignment

Amin Memarian · Touraj Laleh · Irina Rish · Ardavan S. Nobandegani

Abstract

Video

Chat is not available.