Skip to yearly menu bar Skip to main content


Poster

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

Jiaze Li ⋅ Hao Yin ⋅ Haoran Xu ⋅ Boshen Xu ⋅ Wenhui Tan ⋅ Zewen He ⋅ Jianzhong Ju ⋅ Zhenbo Luo ⋅ Jian Luan

Abstract

Log in and register to view live content