Skip to yearly menu bar Skip to main content


Poster

Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu ⋅ Chaoyue Liu ⋅ Adityanarayanan Radhakrishnan ⋅ Misha Belkin
2024 Poster

Abstract

Chat is not available.