Skip to yearly menu bar Skip to main content


Poster

UniScale: Adaptive Unified Inference Scaling via Online Joint Optimization of Model Routing and Test-Time Scaling

Kaiyu Huang ⋅ Xingyu Wang ⋅ Mingze Kong ⋅ Zhubo Shi ⋅ Yuqian Hou ⋅ Hong Xu ⋅ Zhongxiang Dai ⋅ Minchen Yu ⋅ Qingjiang Shi

Abstract

Log in and register to view live content