Skip to yearly menu bar Skip to main content


DiLoCo: Distributed Low-Communication Training of Language Models

Arthur Douillard · Qixuan Feng · Andrei Rusu · Rachita Chhaparia · Yani Donchev · Adhiguna Kuncoro · Marc'Aurelio Ranzato · Arthur Szlam · Jiajun Shen

Abstract

Chat is not available.