Skip to yearly menu bar Skip to main content


(9 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jul 27 06:00 PM -- 06:08 PM (PDT) @ Meeting Room 313 None
Mimetic Initialization of Self-Attention Layers
Asher Trockman · Zico Kolter
[ PDF
Oral
Thu Jul 27 06:08 PM -- 06:16 PM (PDT) @ Meeting Room 313 None
Difference of submodular minimization via DC programming
Marwa El Halabi · George Orfanides · Tim Hoheisel
[ Slides [ PDF
Oral
Thu Jul 27 06:16 PM -- 06:24 PM (PDT) @ Meeting Room 313 None
Simplex Random Features
Isaac Reid · Krzysztof Choromanski · Valerii Likhosherstov · Adrian Weller
[ PDF
Oral
Thu Jul 27 06:24 PM -- 06:32 PM (PDT) @ Meeting Room 313 None
Patch-level Routing in Mixture-of-Experts is Provably Sample-efficient for Convolutional Neural Networks
Mohammed Nowaz Rabbani Chowdhury · Shuai Zhang · Meng Wang · Sijia Liu · Pin-Yu Chen
[ Slides [ PDF
Oral
Thu Jul 27 06:32 PM -- 06:40 PM (PDT) @ Meeting Room 313 None
Tilted Sparse Additive Models
Yingjie Wang · Hong Chen · Weifeng Liu · Fengxiang He · Tieliang Gong · YouCheng Fu · Dacheng Tao
[ PDF
Oral
Thu Jul 27 06:40 PM -- 06:48 PM (PDT) @ Meeting Room 313 None
Dynamic Regularized Sharpness Aware Minimization in Federated Learning: Approaching Global Consistency and Smooth Landscape
Yan Sun · Li Shen · Shixiang Chen · Liang Ding · Dacheng Tao
[ PDF
Oral
Thu Jul 27 06:48 PM -- 06:56 PM (PDT) @ Meeting Room 313 None
Hyena Hierarchy: Towards Larger Convolutional Language Models
Michael Poli · Stefano Massaroli · Eric Nguyen · Daniel Y Fu · Tri Dao · Stephen Baccus · Yoshua Bengio · Stefano Ermon · Christopher Re
[ PDF
Oral
Thu Jul 27 06:56 PM -- 07:04 PM (PDT) @ Meeting Room 313 None
Direct Parameterization of Lipschitz-Bounded Deep Networks
Ruigang Wang · Ian Manchester
[ PDF
Oral
Thu Jul 27 07:12 PM -- 07:20 PM (PDT) @ Meeting Room 313 None
Subsample Ridge Ensembles: Equivalences and Generalized Cross-Validation
Jin-Hong Du · Pratik Patil · Arun Kuchibhotla
[ Slides [ PDF