Skip to yearly menu bar Skip to main content


Poster

OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction

Huang Huang ⋅ Fangchen Liu ⋅ Letian Fu ⋅ Tingfan Wu ⋅ Mustafa Mukadam ⋅ Jitendra Malik ⋅ Ken Goldberg ⋅ Pieter Abbeel
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.