firstbacksecondback
225 Results
Workshop
|
A Simple and Effective Pruning Approach for Large Language Models Mingjie Sun · Zhuang Liu · Anna Bair · Zico Kolter |
||
Workshop
|
Plan, Eliminate, and Track --- Language Models are Good Teachers for Embodied Agents. Yue Wu · So Yeon Min · Yonatan Bisk · Ruslan Salakhutdinov · Amos Azaria · Yuanzhi Li · Tom Mitchell · Shrimai Prabhumoye |
||
Workshop
|
Don't trust your eyes: on the (un)reliability of feature visualizations |
||
Workshop
|
Continual Pre-Training of Large Language Models: How to re-warm your model? Kshitij Gupta · Benjamin Thérien · Adam Ibrahim · Mats Richter · Quentin Anthony · Eugene Belilovsky · Timothée Lesort · Irina Rish |
||
Poster
|
Wed 17:00 |
Towards credible visual model interpretation with path attribution Naveed Akhtar · Mohammad Jalwana |
|
Workshop
|
Teach GPT To Phish Ashwinee Panda · Zhengming Zhang · Yaoqing Yang · Prateek Mittal |
||
Poster
|
Wed 14:00 |
Less is More: Task-aware Layer-wise Distillation for Language Model Compression Chen Liang · Simiao Zuo · Qingru Zhang · Pengcheng He · Weizhu Chen · Tuo Zhao |
|
Workshop
|
Can Public Large Language Models Help Private Cross-device Federated Learning? Boxin Wang · Yibo J. Zhang · Yuan Cao · Bo Li · Hugh B McMahan · Sewoong Oh · Zheng Xu · Manzil Zaheer |
||
Workshop
|
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting Liangchen Luo · Lei Shu · Jayakumar Hoskere · Yun Zhu · Canoee Liu · Simon Tong · Jindong Chen · Lei Meng |
||
Poster
|
Wed 17:00 |
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models Guangxuan Xiao · Ji Lin · Mickael Seznec · Hao Wu · Julien Demouth · Song Han |
|
Workshop
|
DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining Sang Michael Xie · Hieu Pham · Xuanyi Dong · Nan Du · Hanxiao Liu · Yifeng Lu · Percy Liang · Quoc Le · Tengyu Ma · Adams Wei Yu |
||
Workshop
|
Are Emergent Abilities of Large Language Models a Mirage? Rylan Schaeffer · Brando Miranda · Sanmi Koyejo |