Skip to yearly menu bar Skip to main content


LAVA: Language Audio Vision Alignment for Pre-Training Transformers on Video Data

Sumanth Gurram · Sumanth Gurram · David Chan · David Chan · Andy Fang · Andy Fang · John Canny · John Canny

Abstract

Chat is not available.