Poster
|
Wed 17:00
|
RLEG: Vision-Language Representation Learning with Diffusion-based Embedding Generation
Liming Zhao · Kecheng Zheng · Yun Zheng · Deli Zhao · Jingren Zhou
|
|
Workshop
|
|
Refined and Enriched Physics-based Captions For Unseen Dynamic Changes
|
|
Workshop
|
|
Refined and Enriched Physics-based Captions for Unseen Dynamic Changes
Hidetomo Sakaino
|
|
Workshop
|
|
Spotlight: Mobile UI Understanding using Vision-Language Models with a Focus
Gang Li · Yang Li
|
|
Poster
|
Thu 16:30
|
ILLUME: Rationalizing Vision-Language Models through Human Interactions
Manuel Brack · Patrick Schramowski · Björn Deiseroth · Kristian Kersting
|
|
Workshop
|
|
Identifying Implicit Social Biases in Vision-Language Models
Kimia Hamidieh · Haoran Zhang · Thomas Hartvigsen · Marzyeh Ghassemi
|
|
Poster
|
Wed 14:00
|
Distilling Internet-Scale Vision-Language Models into Embodied Agents
Theodore R Sumers · Kenneth Marino · Arun Ahuja · Rob Fergus · Ishita Dasgupta
|
|
Workshop
|
|
The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-language Models
Chenwei Wu · Li Li · Stefano Ermon · Patrick Haffner · Rong Ge · Zaiwei Zhang
|
|
Workshop
|
|
What’s left can’t be right - The remaining positional incompetence of contrastive vision-language models
Nils Hoehing · Ellen Rushe · Anthony Ventresque
|
|
Poster
|
Wed 17:00
|
Continual Vision-Language Representation Learning with Off-Diagonal Information
zixuan ni · Longhui Wei · Siliang Tang · Yueting Zhuang · Qi Tian
|
|
Poster
|
Thu 16:30
|
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Dachuan Shi · Chaofan Tao · Ying Jin · Zhendong Yang · Chun Yuan · Jiaqi Wang
|
|
Workshop
|
|
UOTA: Unsupervised Open-Set Task Adaptation Using a Vision-Language Foundation Model
Youngjo Min · Kwangrok Ryoo · Bumsoo Kim · Taesup Kim
|
|