Skip to yearly menu bar Skip to main content


Poster

XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations

Shichao Fan ⋅ Kun Wu ⋅ Zhengping Che ⋅ Xinhua Wang ⋅ Di Wu ⋅ Fei Liao ⋅ Ning Liu ⋅ Yixue Zhang ⋅ Zhen Zhao ⋅ Zhiyuan Xu ⋅ Meng Li ⋅ Qingjie Liu ⋅ Shanghang Zhang ⋅ Min Wan ⋅ Jian Tang

Abstract

Log in and register to view live content