Skip to yearly menu bar Skip to main content


Poster Wed, Jul 8, 2026 • 2:30 PM – 4:15 PM KST Coex: HALL A

Identifying and Mitigating Errors in Gradient Aggregation of Distributed Data Parallel Training

Zhenheng Tang ⋅ Junlin Huang ⋅ Zichen TANG ⋅ Xueze Kang ⋅ Yuxin Wang ⋅ Peijie Dong ⋅ Shaohuai Shi ⋅ Xiaowen Chu ⋅ Bo Li

Abstract

Log in and register to view live content