Skip to yearly menu bar Skip to main content


Gradient descent induces alignment between weights and the pre-activation tangents for deep non-linear networks

Daniel Beaglehole · Ioannis Mitliagkas · Atish Agarwala

Abstract

Chat is not available.