Skip to yearly menu bar Skip to main content


MET-Bench: Multimodal Entity Tracking for Evaluating the Limitations of Vision-Language and Reasoning Models

Vanya Cohen · Ray Mooney

Abstract

Chat is not available.