Skip to yearly menu bar Skip to main content


(4 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Wed Jul 24 07:30 AM -- 07:45 AM (PDT) @ Hall A2 None
Stealing part of a production language model
Nicholas Carlini · Daniel Paleka · Krishnamurthy Dvijotham · Thomas Steinke · Jonathan Hayase · A. Feder Cooper · Katherine Lee · Matthew Jagielski · Milad Nasr · Arthur Conmy · Eric Wallace · David Rolnick · Florian Tramer
Oral
Wed Jul 24 07:45 AM -- 08:00 AM (PDT) @ Hall A2 None
Trained Random Forests Completely Reveal your Dataset
Julien Ferry · Ricardo Fukasawa · Timothée Pascal · Thibaut Vidal
[ Slides
Oral
Wed Jul 24 08:00 AM -- 08:15 AM (PDT) @ Hall A2 None
AI Control: Improving Safety Despite Intentional Subversion
Ryan Greenblatt · Buck Shlegeris · Kshitij Sachan · Fabien Roger
Oral
Wed Jul 24 08:15 AM -- 08:30 AM (PDT) @ Hall A2 None
Low-Cost High-Power Membership Inference Attacks
Sajjad Zarifzadeh · Philippe Liu · Reza Shokri