Skip to yearly menu bar Skip to main content


Poster

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Chengzhuo Tong ⋅ Chang Mingkun ⋅ Shenglong Zhang ⋅ Yuran Wang ⋅ Cheng Liang ⋅ Zhizheng Zhao ⋅ Bohan Zeng ⋅ Yang Shi ⋅ Ruichuan An ⋅ Yifan Dai ⋅ Ziming Zhao ⋅ Guanbin Li ⋅ Pengfei Wan ⋅ Yuanxing Zhang ⋅ Wentao Zhang

Abstract

Log in and register to view live content