2nd Workshop on Generative AI and Law (GenLaw ’24)

Workshop

2nd Workshop on Generative AI and Law (GenLaw ’24)

Katherine Lee · A. Feder Cooper · Niloofar Mireshghallah · James Grimmelmann · Matthew Jagielski · Milad Nasr · Fernando Delgado · Lydia Belkadi

Lehar 2

Sat 27 Jul, midnight PDT

[ Abstract ] Workshop Website

Excitement about the capabilities of generative-AI systems has touched nearly every corner of ML research and public life. Amid such exhilarating potential, there is also intensifying unease around the development and deployment of generative-AI systems. By now, it is well-known that generative models ingest vast quantities of intellectual property (IP) [8–10], which they can regurgitate verbatim [1–3, 11, 12]. Such memorization has been the continued focus of copyright-focused lawsuits [4], but memorization and copyright just scratch the surface of potential legal issues at play. In the report from our ICML workshop last year, we produced a taxonomy of emerging issues that touch on intent, privacy, misinformation and disinformation, and IP (more broadly) [5]. Indeed, based on the events of the past year alone — executive orders [13], lawsuits [4], new and amended laws [7], and labor strikes [6] — it has only become clearer that there are significant “technical, doctrinal, and policy challenges presented by law for Generative AI, and by Generative AI for law” [5]. Within this challenging and fast-moving landscape, GenLaw has played an important clarifying and cross-educational role. The first GenLaw workshop at ICML 2023 hosted over 400 attendees in person, and our workshop recording has been watched over 1k times. Collectively, our blog and workshop report have been viewed over 25k times. GenLaw has helped pose novel questions (and refine existing ones) that push the frontier of generative-AI system capabilities in ways that attend to important legal considerations. We have been told repeatedly that the keynotes, panels, and conversations at last year’s workshop have even changed the trajectories of numerous Ph.D. students’ research, and have sparked entire new lines of inquiry in law and policy.Building on our past success, our workshop will continue to develop a comprehensive and precise synthesis of the legal issues at play, and of the associated ML research questions that these issues raise. We will leverage ICML’s location in Vienna to widen the scope of our legal engagement to the UK and EU, centering keynotes and panel participation from UK and EU researchers and scholars. Drawing from the research program developed in last year’s workshop report [5], we will concentrate our program on issues of IP, mis-/dis-information, and privacy. Based on (1) enthusiasm from the community to hold another GenLaw workshop at ICML, (2) interest in response to soliciting speakers and PC members, and (3) the continued explosion of general public interest in generative AI, we expect around 300 attendees in person, and at least another 300 virtually.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 12:00 a.m. - 12:15 a.m.	Opening Remarks ( Talk ) > SlidesLive Video	A. Feder Cooper · Katherine Lee 🔗
Sat 12:15 a.m. - 12:30 a.m.	Kyle Lo (Allen Institute for AI), Training Data Curation for OLMo ( Invited Talk ) > SlidesLive Video	🔗
Sat 12:30 a.m. - 12:45 a.m.	Gabriele Mazzini, Introduction to the AI Act and Generative AI ( Invited Talk ) > SlidesLive Video	🔗
Sat 12:45 a.m. - 1:00 a.m.	Martin SenftLeben (U. Amsterdam), Copyright and GenAI Development: Regulatory Approaches and Challenges in the EU and Beyond ( Invited Talk ) > SlidesLive Video	🔗
Sat 1:00 a.m. - 1:30 a.m.	Coffee	🔗
Sat 1:30 a.m. - 2:30 a.m.	Panel: Data Curation and IP ( Panel ) > SlidesLive Video	🔗
Sat 2:30 a.m. - 2:45 a.m.	Connor Dunlop (Ada Lovelace), GPAI governance and oversight in the EU – and how you might be able to contribute ( Invite Talk ) > SlidesLive Video	🔗
Sat 2:45 a.m. - 3:00 a.m.	Sabrina Küspert (EU AI Commission), “Implementing the AI Act” ( Invited Talk ) > SlidesLive Video	🔗
Sat 3:00 a.m. - 4:30 a.m.	Lunch	🔗
Sat 4:30 a.m. - 5:00 a.m.	Spotlight Paper Presentations ( Spotlights ) > SlidesLive Video	🔗
Sat 5:00 a.m. - 5:30 a.m.	Poster Session ( Poster Session ) >	🔗
Sat 6:00 a.m. - 6:30 a.m.	Coffee	🔗
Sat 6:30 a.m. - 6:45 a.m.	Matthew Jagielski and Katja Filippova (Google Deepmind), Machine Unlearning ( Invited Talk ) > SlidesLive Video	Matthew Jagielski · Katja Filippova 🔗
Sat 6:45 a.m. - 7:00 a.m.	Kimberly Mai (U.K. ICO), Data Protection in the Era of Generative AI ( Invited Talk ) > SlidesLive Video	🔗
Sat 7:00 a.m. - 7:15 a.m.	Herbie Bradley (UK AISI), Digital Services Act ( Invited Talk ) > SlidesLive Video	🔗
Sat 7:15 a.m. - 8:00 a.m.	Panel: Privacy and Data Policy ( Panel ) > SlidesLive Video	🔗
-	Disguised Copyright Infringement of Latent Diffusion Models ( Poster ) >	Yiwei Lu · Matthew Yang · Zuoqiu Liu · Gautam Kamath · Yaoliang Yu 🔗
-	Artificial Inventorship ( Poster ) >	Lital Helman 🔗
-	Robustness in the EU Artificial Intelligence Act ( Poster ) >	Henrik Nolte · Miriam Rateike · Michèle Finck 🔗
-	The Files are in the Computer: Copyright, Memorization, and Generative AI ( Poster ) >	A. Feder Cooper · James Grimmelmann 🔗
-	Building a Long-Text Privacy Policy Corpus with Multi-Class Labels ( Poster ) >	David Stein · Florencia Marotta-Wurgler 🔗
-	Bias in Legal Data for Generative AI ( Poster ) >	Holli Sargeant · Måns Magnusson 🔗
-	Laypeople’s Egocentric Perceptions of Copyright For AI-Generated Art ( Poster ) >	Gabriel Lima · Nina Grgić-Hlača · Elissa Redmiles 🔗
-	The Defamation Machine ( Poster ) >	James Grimmelmann 🔗
-	Moral and Legal Responsibility for General-Purpose Technologies ( Poster ) >	James Grimmelmann · David Gray Widder 🔗
-	Federated Learning and AI Regulation in the European Union: Who is liable? – An Interdisciplinary Analysis ( Poster ) >	Herbert Woisetschlaeger · Simon Mertel · Ruben Mayer · Christoph Krönke · Hans-Arno Jacobsen 🔗
-	Protecting Text IP in the Era of LLMs with Robust and Scalable Watermarking ( Poster ) >	Gregory Kang Ruey Lau · Xinyuan Niu · Hieu Dao · Jiangwei Chen · Chuan Sheng Foo · Bryan Kian Hsiang Low 🔗
-	Infinite Exclusivity: Generative AI’s Endless Challenges to the “Exclusive Right” in European Copyright Frameworks ( Poster ) >	Zachary Cooper 🔗
-	The Dilemma of Uncertainty Estimation and Systemic Risk in the EU AI Act ( Poster ) >	Matias Valdenegro-Toro · Radina Stoykova 🔗
-	Care for Chatbots ( Poster ) >	Peter Wills 🔗
-	Learning to Copy ( Poster ) >	Peter Wills 🔗
-	Navigating Risks and Rewards of Generative Model-based Synthetic Datasets: A Regulatory Perspective ( Poster ) >	Debalina Renishkumar Padariya · Isabel Wagner 🔗
-	Giving yourself away for training? The problem of ‘Pay or Okay’ for Generative AI ( Poster ) >	Margarita Amaxopoulou 🔗
-	What Lies Ahead for Generative AI Watermarking ( Poster ) >	Pierre Fernandez · Anthony Level · Teddy Furon 🔗
-	CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images ( Poster ) >	Aaron Gokaslan · A. Feder Cooper · Jasmine Collins · Landan Seguin · Austin Jacobson · Mihir Patel · Jonathan Frankle · Cory Stephenson · Volodymyr Kuleshov 🔗
-	GROG: Reducing LLM Hallucinations for Improved Legal Reasoning ( Poster ) >	Daniel McNeela 🔗
-	Strong Copyright Protection for Language Models via Adaptive Model Fusion ( Poster ) >	Javier Abad · Konstantin Donhauser · Francesco Pinto 🔗
-	The Data Minimization Principle in Machine Learning ( Poster ) >	Ferdinando Fioretto 🔗
-	Capacity Control is an Effective Memorization Mitigation Mechanism ( Poster ) >	Raman Dutt · Pedro Sanchez · Ondrej Bohdal · Sotirios Tsaftaris · Timothy Hospedales 🔗
-	Diffusion Unlearning Optimization for Robust and Safe Text-to-Image Models ( Poster ) >	Yong-Hyun Park · Sangdoo Yun · Jin-Hwa Kim · Junho Kim · Geonhui Jang · Yonghyun Jeong · Junghyo Jo · Gayoung Lee 🔗
-	Bias as a Feature ( Poster ) >	Uri Hacohen · Niva Elkin-Koren 🔗
-	Not Every Image is Worth a Thousand Words: Quantifying Originality in Stable Diffusion ( Poster ) >	Adi Haviv · Shahar Sarfaty · Uri Hacohen · Niva Elkin-Koren · Roi Livni · Amit Bermano 🔗
-	Federated Learning Priorities Under the European Union Artificial Intelligence Act ( Poster ) >	Herbert Woisetschlaeger · Alexander Erben · Bill Marino · Shiqiang Wang · Nic Lane · Ruben Mayer · Hans-Arno Jacobsen 🔗
-	Relearning what success means for machine unlearning ( Poster ) >	18 presenters A. Feder Cooper · Christopher Choquette Choo · Ken Ziyu Liu · Amy Cyphert · Mark Lemley · Miranda Bogen · Matthew Jagielski · Seth Neel · Niloofar Mireshghallah · Lama Ahmad · James Grimmelmann · David Bau · Vitaly Shmatikov · Fernando Delgado · Chris De Sa · Katja Filippova · Nicolas Papernot · Katherine Lee 🔗
-	Ageism unrestrained? The unchallenged bias against older people in AI ( Poster ) >	Arna Woemmel · Aileen Nielsen 🔗
-	Rethinking LLM Memorization through the Lens of\\Adversarial Compression ( Poster ) >	Avi Schwarzschild · Zhili Feng · Pratyush Maini · Ethan Herenstein · Zachary Lipton 🔗
-	Memorization is Localized within a Small Subspace in Diffusion Models ( Poster ) >	Ruchika Chavhan · Yongshuo Zong · Ondrej Bohdal · Da Li · Timothy Hospedales 🔗
-	Attention: Your Conversational Data is What They Need ( Poster ) >	Jakob Merane 🔗
-	Quantifying Likeness: A Computer Vision Approach to Identifying Style and Copyright Infringement in AI-Generated Artwork ( Poster ) >	Michaela Drouillard · Ryan Spencer · Nikée Allen · Tegan Maharaj 🔗
-	The Ground Truth about Legal Hallucinations ( Poster ) >	Eliza Mik 🔗
-	Dallma: Semi-Structured Legal Reasoning and Drafting with Large Language Models ( Poster ) >	Hannes Westermann 🔗
-	Evaluating Copyright Takedown Methods for Language Models ( Poster ) >	Boyi Wei · Weijia Shi · Yangsibo Huang · Noah Smith · Chiyuan Zhang · Luke Zettlemoyer · Kai Li · Peter Henderson 🔗
-	Opt-out or consent? The AI industry’s approach to leveraging Terms of Service to harvest user data ( Poster ) >	Giulia Olivato 🔗
-	Unlocking Fair Use in the Generative AI Supply Chain: A Systematized Literature Review ( Poster ) >	Amruta Mahuli · Asia Biega 🔗
-	Randomization Techniques to Mitigate the Risk of Copyright Infringement ( Poster ) >	Wei-Ning Chen · Peter Kairouz · Sewoong Oh · Zheng Xu 🔗
-	Machine Unlearning via Simulated Oracle Matching ( Poster ) >	Shivam Garg · Kristian Georgiev · Andrew Ilyas · Sung Min (Sam) Park · Roy Rinberg · Aleksander Madry · Seth Neel 🔗
-	Synthetic Data, Similarity-based Privacy Metrics, and Regulatory (Non-)Compliance ( Poster ) >	Georgi Ganev 🔗
-	Experimenting with Legal AI Solutions: The Case of Question-Answering for Access to Justice ( Poster ) >	Jonathan Li · Rohan Bhambhoria · Dahan Samuel · Xiaodan Zhu 🔗
-	The Revealed Preferences of Pre-authorized Licenses and Their Ethical Implications for Generative Models ( Poster ) >	Vinith Suriyakumar · Peter Menell · Dylan Hadfield-Menell · Ashia Wilson 🔗
-	Privacy, Transformed? Lessons from Generative Artificial Intelligence ( Poster ) >	Alicia Solow-Niederman 🔗
-	Computational Copyright: Towards A Royalty Model for Music Generative AI ( Poster ) >	Junwei Deng · Shiyuan Zhang · Jiaqi Ma 🔗
-	Chilling autonomy: Policy enforcement for human oversight of AI agents ( Poster ) >	Peter Cihon 🔗
-	Community Norms as Self-Regulation of Generative AI in Creative Industries ( Poster ) >	Falaah Arif Khan · Peter Hall · Betty L Hou 🔗
-	“Heart on My Sleeve”: From Memorization to Duty ( Poster ) >	Nathan Reitinger 🔗
-	MUSE: Machine Unlearning Six-Way Evaluation for Language Models ( Poster ) >	Weijia Shi · Jaechan Lee · Yangsibo Huang · Sadhika Malladi · Jieyu Zhao · Ari Holtzman · Daogao Liu · Luke Zettlemoyer · Noah Smith · Chiyuan Zhang 🔗
-	Liability and Insurance for Catastrophic Losses: the Nuclear Power Precedent and Lessons for AI ( Poster ) >	Cristian Trout 🔗
-	Insuring Uninsurable Risks from AI: Government as Insurer of Last Resort ( Poster ) >	Cristian Trout 🔗
-	Evaluations of Machine Learning Privacy Defenses are Misleading ( Poster ) >	Michael Aerni · Jie Zhang · Florian Tramer 🔗
-	Tracing datasets usage in the wild with data taggants ( Poster ) >	Wassim Bouaziz · El-Mahdi El-Mhamdi · Nicolas Usunier 🔗
-	Examining Data Compartmentalization for AI Governance ( Poster ) >	Nicole Mitchell · Peter Kairouz · Sébastien Krier · Eleni Triantafillou 🔗
-	Standardization of Behavioral Use Clauses is Necessary for the Adoption of Responsible Licensing of A ( Poster ) >	12 presenters Daniel McDuff · Tim Korjakow · Scott Cambo · Jesse Benjamin · Jenny Lee · Yacine Jernite · Aaron Gokaslan · Carlos Muñoz Ferrandis · Alek Tarkowski · Joseph Lindley · A. Feder Cooper · Danish Contractor 🔗
-	LLM Dataset Inference: Did you train on my dataset? ( Poster ) >	Pratyush Maini · Hengrui Jia · Nicolas Papernot · Adam Dziedzic 🔗
-	If You Give an LLM a Legal Practice Guide ( Poster ) >	Aaron D. Tucker · Colin Doyle 🔗
-	Real Risks for Fake Data: Synthetic Data, Diversity-Washing and Consent Circumvention ( Poster ) >	Cedric Whitney 🔗
-	Generative AI Risk Categorization Decoded: Comparing Public and Private Sector Policies ( Poster ) >	Yi Zeng · Kevin Klyman · Andy Zhou · Yu Yang · Minzhou Pan · Ruoxi Jia · Dawn Song · Percy Liang · Bo Li 🔗
-	L-FRESco: Factual Recall Evaluation Score for Legal Analysis Generation ( Poster ) >	Abe Hou · Zhengping Jiang · Guanghui Qin · Orion Weller · Andrew Blair-Stanek · Benjamin Van Durme 🔗
-	Ignore Safety Directions. Violate the CFAA? ( Spotlight ) >	Ram Shankar Siva Kumar · Kendra Albert · Jonathon Penney 🔗
-	Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI ( Spotlight ) >	Robert Hönig · Javier Rando · Nicholas Carlini · Florian Tramer 🔗
-	Ordering Model Deletion ( Spotlight ) >	Daniel Wilf-Townsend 🔗
-	Fantastic Copyrighted Beasts and How (Not) to Generate Them ( Spotlight ) >	Luxi He · Yangsibo Huang · Weijia Shi · Tinghao Xie · Haotian Liu · Yue Wang · Luke Zettlemoyer · Chiyuan Zhang · Danqi Chen · Peter Henderson 🔗
-	Training Foundation Models as Data Compression: On Information, Model Weights and Copyright Law ( Spotlight ) >	Giorgio Franceschelli · Claudia Cevenini · Mirco Musolesi 🔗
-	Machine Unlearning Fails to Remove Data Poisoning Attacks ( Spotlight ) >	Martin Pawelczyk · Ayush Sekhari · Jimmy Di · Yiwei Lu · Gautam Kamath · Seth Neel 🔗