Skip to yearly menu bar Skip to main content


When does gradient descent with logistic loss interpolate using deep networks with smoothed ReLU activations?

Niladri Chatterji ⋅ Phil Long ⋅ Peter Bartlett

Abstract

Chat is not available.