Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 6:30 PM – 8:15 PM PDT HALL A #2712

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers

Wenhan Ma ⋅ Hailin Zhang ⋅ Liang Zhao ⋅ Yifan Song ⋅ Yudong Wang ⋅ Fuli Luo ⋅ Zhifang Sui

Abstract

Log in and register to view live content