Skip to yearly menu bar Skip to main content


Search All 2023 Events
 

227 Results

<<   <   Page 4 of 19   >   >>
Workshop
Introducing Vision into Large Language Models Expands Attack Surfaces and Failure Implications
Workshop
Fri 15:15 EPITOME: Experimental Protocol Inventory for Theory Of Mind Evaluation
Cameron Jones · Sean Trott · Ben Bergen
Workshop
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani · Ali Beyram · Harsh Shrivastava
Workshop
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models
Shima Imani · Ali Beyram · Harsh Shrivastava
Workshop
Large Language Models for Code: Security Hardening and Adversarial Testing
Jingxuan He · Martin Vechev
Workshop
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Varshini Subhash · Anna Bialas · Siddharth Swaroop · Weiwei Pan · Finale Doshi-Velez
Workshop
Baselines for Identifying Watermarked Large Language Models
Leonard Tang · Gavin Uberti · Tom Shlomi
Workshop
Fri 15:15 Preference Proxies: Evaluating Large Language Models in capturing Human Preferences in Human-AI Tasks
Mudit Verma · Siddhant Bhambri · Subbarao Kambhampati
Workshop
The Unseen A+ Student: Navigating the Impact of Large Language Models in the Classroom
Matyáš Boháček
Workshop
Why do universal adversarial attacks work on large language models?: Geometry might be the answer
Poster
Thu 13:30 Guiding Pretraining in Reinforcement Learning with Large Language Models
Yuqing Du · Olivia Watkins · Zihan Wang · Cédric Colas · Trevor Darrell · Pieter Abbeel · Abhishek Gupta · Jacob Andreas
Oral
Wed 19:08 Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Stella Biderman · Hailey Schoelkopf · Quentin Anthony · Herbie Bradley · Kyle O'Brien · Eric Hallahan · Mohammad Aflah Khan · Shivanshu Purohit · USVSN Sai Prashanth · Edward Raff · Aviya Skowron · Lintang Sutawika · Oskar van der Wal