Skip to yearly menu bar Skip to main content


LAVA: Language Audio Vision Alignment for Pre-Training Transformers on Video Data

Sumanth Gurram ⋅ Sumanth Gurram ⋅ David Chan ⋅ David Chan ⋅ Andy Fang ⋅ Andy Fang ⋅ John Canny ⋅ John Canny

Abstract

Chat is not available.