Skip to yearly menu bar Skip to main content


(9 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Jul 28 10:00 AM -- 10:08 AM (KST) @ Meeting Room 313 None
Mimetic Initialization of Self-Attention Layers
Asher Trockman · Zico Kolter
[ PDF
Oral
Fri Jul 28 10:08 AM -- 10:16 AM (KST) @ Meeting Room 313 None
Difference of submodular minimization via DC programming
Marwa El Halabi · George Orfanides · Tim Hoheisel
[ Slides [ PDF
Oral
Fri Jul 28 10:16 AM -- 10:24 AM (KST) @ Meeting Room 313 None
Simplex Random Features
Isaac Reid · Krzysztof Choromanski · Valerii Likhosherstov · Adrian Weller
[ PDF
Oral
Fri Jul 28 10:24 AM -- 10:32 AM (KST) @ Meeting Room 313 None
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks
Mohammed Nowaz Rabbani Chowdhury · Shuai Zhang · Meng Wang · Sijia Liu · Pin-Yu Chen
[ Slides [ PDF
Oral
Fri Jul 28 10:32 AM -- 10:40 AM (KST) @ Meeting Room 313 None
Tilted Sparse Additive Models
Yingjie Wang · Hong Chen · Weifeng Liu · Fengxiang He · Tieliang Gong · YouCheng Fu · Dacheng Tao
[ PDF
Oral
Fri Jul 28 10:40 AM -- 10:48 AM (KST) @ Meeting Room 313 None
Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape
Yan Sun · Li Shen · Shixiang Chen · Liang Ding · Dacheng Tao
[ PDF
Oral
Fri Jul 28 10:48 AM -- 10:56 AM (KST) @ Meeting Room 313 None
Hyena Hierarchy: Towards Larger Convolutional Language Models
Michael Poli · Stefano Massaroli · Eric Nguyen · Daniel Y Fu · Tri Dao · Stephen Baccus · Yoshua Bengio · Stefano Ermon · Christopher Re
[ PDF
Oral
Fri Jul 28 10:56 AM -- 11:04 AM (KST) @ Meeting Room 313 None
Direct Parameterization of Lipschitz-Bounded Deep Networks
Ruigang Wang · Ian Manchester
[ PDF
Oral
Fri Jul 28 11:12 AM -- 11:20 AM (KST) @ Meeting Room 313 None
Subsample Ridge Ensembles: Equivalences and Generalized Cross-Validation
Jin-Hong Du · Pratik Patil · Arun Kuchibhotla
[ Slides [ PDF