Skip to yearly menu bar Skip to main content


DiLoCo: Distributed Low-Communication Training of Language Models

Arthur Douillard ⋅ Qixuan Feng ⋅ Andrei Rusu ⋅ Rachita Chhaparia ⋅ Yani Donchev ⋅ Adhiguna Kuncoro ⋅ Marc'Aurelio Ranzato ⋅ Arthur Szlam ⋅ Jiajun Shen

Abstract

Chat is not available.