Skip to yearly menu bar Skip to main content


Oral Wed, Jul 8, 2026 • 4:15 PM – 4:30 PM KST

From Pixels to Tokens: A Systematic Study of Latent Action Supervision for Vision-Language-Action Models

Yihan Lin ⋅ Haoyang Li ⋅ Yang Li ⋅ Haitao Shen ⋅ Yihan Zhao ⋅ Chao Shao ⋅ Jing Zhang

Abstract

Log in and register to view live content