Workshop
|
|
Introducing Vision into Large Language Models Expands Attack Surfaces and Failure Implications
|
|
Workshop
|
Fri 15:15
|
EPITOME: Experimental Protocol Inventory for Theory Of Mind Evaluation
Cameron Jones · Sean Trott · Ben Bergen
|
|
Workshop
|
|
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani · Ali Beyram · Harsh Shrivastava
|
|
Workshop
|
|
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani · Ali Beyram · Harsh Shrivastava
|
|
Workshop
|
|
Large Language Models for Code: Security Hardening and Adversarial Testing
Jingxuan He · Martin Vechev
|
|
Workshop
|
|
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Varshini Subhash · Anna Bialas · Siddharth Swaroop · Weiwei Pan · Finale Doshi-Velez
|
|
Workshop
|
|
Baselines for Identifying Watermarked Large Language Models
Leonard Tang · Gavin Uberti · Tom Shlomi
|
|
Workshop
|
Fri 15:15
|
Preference Proxies: Evaluating Large Language Models in capturing Human Preferences in Human-AI Tasks
Mudit Verma · Siddhant Bhambri · Subbarao Kambhampati
|
|
Workshop
|
|
The Unseen A+ Student: Navigating the Impact of Large Language Models in the Classroom
Matyáš Boháček
|
|
Workshop
|
|
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
|
|
Poster
|
Thu 13:30
|
Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du · Olivia Watkins · Zihan Wang · Cédric Colas · Trevor Darrell · Pieter Abbeel · Abhishek Gupta · Jacob Andreas
|
|
Oral
|
Wed 19:08
|
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Stella Biderman · Hailey Schoelkopf · Quentin Anthony · Herbie Bradley · Kyle O'Brien · Eric Hallahan · Mohammad Aflah Khan · Shivanshu Purohit · USVSN Sai Prashanth · Edward Raff · Aviya Skowron · Lintang Sutawika · Oskar van der Wal
|
|