10   Show all »
Toggle Poster Visibility
Oral
Tue Jun 11th 04:00 -- 04:20 PM @ Hall B
An Investigation into Neural Net Optimization via Hessian Eigenvalue Density
Behrooz Ghorbani · Shankar Krishnan · Ying Xiao
Oral
Tue Jun 11th 04:20 -- 04:25 PM @ Hall B
Differentiable Linearized ADMM
Xingyu Xie · Jianlong Wu · Guangcan Liu · Zhisheng Zhong · Zhouchen Lin
Oral
Tue Jun 11th 04:25 -- 04:30 PM @ Hall B
Adaptive Stochastic Natural Gradient Method for One-Shot Neural Architecture Search
Youhei Akimoto · Shinichi Shirakawa · Nozomu Yoshinari · Kento Uchida · Shota Saito · Kouhei Nishida
Oral
Tue Jun 11th 04:30 -- 04:35 PM @ Hall B
A Quantitative Analysis of the Effect of Batch Normalization on Gradient Descent
YongQiang Cai · Qianxiao Li · Zuowei Shen
Oral
Tue Jun 11th 04:35 -- 04:40 PM @ Hall B
The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study
Daniel Park · Jascha Sohl-Dickstein · Quoc Le · Samuel L Smith
Oral
Tue Jun 11th 04:40 -- 05:00 PM @ Hall B
AdaGrad stepsizes: sharp convergence over nonconvex landscapes
Rachel Ward · Xiaoxia Wu · Leon Bottou
Oral
Tue Jun 11th 05:00 -- 05:05 PM @ Hall B
Beyond Backprop: Online Alternating Minimization with Auxiliary Variables
Anna Choromanska · Benjamin Cowen · Sadhana Kumaravel · Ronny Luss · Mattia Rigotti · Irina Rish · Paolo DiAchille · Viatcheslav Gurev · Brian Kingsbury · Ravi Tejwani · Djallel Bouneffouf
Oral
Tue Jun 11th 05:05 -- 05:10 PM @ Hall B
SWALP : Stochastic Weight Averaging in Low Precision Training
Guandao Yang · Tianyi Zhang · Polina Kirichenko · Junwen Bai · Andrew Wilson · Christopher De Sa
Oral
Tue Jun 11th 05:10 -- 05:15 PM @ Hall B
Efficient optimization of loops and limits with randomized telescoping sums
Alex Beatson · Ryan P Adams
Oral
Tue Jun 11th 05:15 -- 05:20 PM @ Hall B
Self-similar Epochs: Value in arrangement
Eliav Buchnik · Edith Cohen · Avinatan Hasidim · Yossi Matias