Skip to yearly menu bar Skip to main content


Poster

PixCLIP: Towards Fine-grained Vision-Language Understanding via Any-granularity Pixel-Text Alignment

YiCheng Xiao ⋅ Yu Chen ⋅ Hao-Xuan Ma ⋅ Jiale Hong ⋅ Caorui Li ⋅ Lingxiang Wu ⋅ Haiyun Guo ⋅ Jinqiao Wang

Abstract

Log in and register to view live content