firstbacksecondback
Filter by Keyword:
54 Results
Spotlight
|
Thu 19:40 |
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation Renjie Zheng · Junkun Chen · Mingbo Ma · Liang Huang |
|
Spotlight
|
Thu 20:30 |
On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization Xu Cai · Jonathan Scarlett |
|
Poster
|
Thu 21:00 |
Few-shot Language Coordination by Modeling Theory of Mind Hao Zhu · Graham Neubig · Yonatan Bisk |
|
Poster
|
Tue 9:00 |
Principal Component Hierarchy for Sparse Quadratic Programs Robbie Vreugdenhil · Viet Anh Nguyen · Armin Eftekhari · Peyman Mohajerin Esfahani |
|
Oral
|
Thu 19:20 |
Mixed Cross Entropy Loss for Neural Machine Translation Haoran Li · Wei Lu |
|
Poster
|
Thu 21:00 |
Mixed Cross Entropy Loss for Neural Machine Translation Haoran Li · Wei Lu |
|
Spotlight
|
Thu 17:35 |
BASE Layers: Simplifying Training of Large, Sparse Models Mike Lewis · Shruti Bhosale · Tim Dettmers · Naman Goyal · Luke Zettlemoyer |
|
Spotlight
|
Thu 7:45 |
EL-Attention: Memory Efficient Lossless Attention for Generation Yu Yan · Jiusheng Chen · Weizhen Qi · Nikhil Bhendawade · Yeyun Gong · Nan Duan · Ruofei Zhang |
|
Poster
|
Thu 21:00 |
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation Xiang Lin · Simeng Han · Shafiq Joty |
|
Oral
|
Thu 17:00 |
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation Xiang Lin · Simeng Han · Shafiq Joty |
|
Poster
|
Thu 21:00 |
BASE Layers: Simplifying Training of Large, Sparse Models Mike Lewis · Shruti Bhosale · Tim Dettmers · Naman Goyal · Luke Zettlemoyer |
|
Poster
|
Thu 9:00 |
EL-Attention: Memory Efficient Lossless Attention for Generation Yu Yan · Jiusheng Chen · Weizhen Qi · Nikhil Bhendawade · Yeyun Gong · Nan Duan · Ruofei Zhang |