ICML 2024

Oral

Tue 1:45

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Collin Burns · Pavel Izmailov · Jan Kirchner · Bowen Baker · Leo Gao · Leopold Aschenbrenner · Yining Chen · Adrien Ecoffet · Manas Joglekar · Jan Leike · Ilya Sutskever · Jeffrey K Wu

Poster

Tue 4:30

WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?
Alexandre Drouin · Maxime Gasse · Massimo Caccia · Issam Laradji · Manuel Del Verme · Tom Marty · David Vazquez · Nicolas Chapados · Alexandre Lacoste

Poster

Wed 2:30

Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh · Ekdeep Singh Lubana · Mikail Khona · Robert Dick · Hidenori Tanaka

Poster

Wed 4:30

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities
Weihao Yu · Zhengyuan Yang · Linjie Li · Jianfeng Wang · Kevin Lin · Zicheng Liu · Xinchao Wang · Lijuan Wang

Poster

Tue 4:30

Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities
Stephen Zhang · Vardan Papyan

Poster

Tue 2:30

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Collin Burns · Pavel Izmailov · Jan Kirchner · Bowen Baker · Leo Gao · Leopold Aschenbrenner · Yining Chen · Adrien Ecoffet · Manas Joglekar · Jan Leike · Ilya Sutskever · Jeffrey K Wu

Poster

Wed 4:30

Characterizing ResNet's Universal Approximation Capability
Chenghao Liu · Enming Liang · Minghua Chen

Workshop

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Rylan Schaeffer · Hailey Schoelkopf · Brando Miranda · Gabriel Mukobi · Varun Madan · Adam Ibrahim · Herbie Bradley · Stella Biderman · Sanmi Koyejo

Workshop

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Rylan Schaeffer · Hailey Schoelkopf · Brando Miranda · Gabriel Mukobi · Varun Madan · Adam Ibrahim · Herbie Bradley · Stella Biderman · Sanmi Koyejo

Workshop

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Rylan Schaeffer · Hailey Schoelkopf · Brando Miranda · Gabriel Mukobi · Varun Madan · Adam Ibrahim · Herbie Bradley · Stella Biderman · Sanmi Koyejo

Workshop

Sat 3:00

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Workshop

Sat 1:00

Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL
Eduardo Pignatelli · Johan Ferret · Davide Paglieri · Samuel Coward · Tim Rocktäschel · Edward Grefenstette · Laura Toni

Main Navigation

15 Results