Skip to yearly menu bar Skip to main content


Poster

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Wu Lin ⋅ Felix Dangel ⋅ Runa Eschenhagen ⋅ Juhan Bae ⋅ Richard E Turner ⋅ Alireza Makhzani
2024 Poster

Abstract

Chat is not available.