firstbacksecondback
15 Results
Oral
|
Tue 1:45 |
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision Collin Burns · Pavel Izmailov · Jan Kirchner · Bowen Baker · Leo Gao · Leopold Aschenbrenner · Yining Chen · Adrien Ecoffet · Manas Joglekar · Jan Leike · Ilya Sutskever · Jeffrey K Wu |
|
Poster
|
Tue 4:30 |
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks? Alexandre Drouin · Maxime Gasse · Massimo Caccia · Issam Laradji · Manuel Del Verme · Tom Marty · David Vazquez · Nicolas Chapados · Alexandre Lacoste |
|
Poster
|
Wed 2:30 |
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks Rahul Ramesh · Ekdeep Singh Lubana · Mikail Khona · Robert Dick · Hidenori Tanaka |
|
Poster
|
Wed 4:30 |
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities Weihao Yu · Zhengyuan Yang · Linjie Li · Jianfeng Wang · Kevin Lin · Zicheng Liu · Xinchao Wang · Lijuan Wang |
|
Poster
|
Tue 4:30 |
Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities Stephen Zhang · Vardan Papyan |
|
Poster
|
Tue 2:30 |
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision Collin Burns · Pavel Izmailov · Jan Kirchner · Bowen Baker · Leo Gao · Leopold Aschenbrenner · Yining Chen · Adrien Ecoffet · Manas Joglekar · Jan Leike · Ilya Sutskever · Jeffrey K Wu |
|
Poster
|
Wed 4:30 |
Characterizing ResNet's Universal Approximation Capability Chenghao Liu · Enming Liang · Minghua Chen |
|
Workshop
|
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer · Hailey Schoelkopf · Brando Miranda · Gabriel Mukobi · Varun Madan · Adam Ibrahim · Herbie Bradley · Stella Biderman · Sanmi Koyejo |
||
Workshop
|
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer · Hailey Schoelkopf · Brando Miranda · Gabriel Mukobi · Varun Madan · Adam Ibrahim · Herbie Bradley · Stella Biderman · Sanmi Koyejo |
||
Workshop
|
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? Rylan Schaeffer · Hailey Schoelkopf · Brando Miranda · Gabriel Mukobi · Varun Madan · Adam Ibrahim · Herbie Bradley · Stella Biderman · Sanmi Koyejo |
||
Workshop
|
Sat 3:00 |
Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? |
|
Workshop
|
Sat 1:00 |
Assessing the Zero-Shot Capabilities of LLMs for Action Evaluation in RL Eduardo Pignatelli · Johan Ferret · Davide Paglieri · Samuel Coward · Tim Rocktäschel · Edward Grefenstette · Laura Toni |