firstbacksecondback
109 Results
Poster
|
Tue 4:30 |
Finding NEM-U: Explaining unsupervised representation learning through neural network generated explanation masks Bjørn Leth Møller · Christian Igel · Kristoffer Wickstrøm · Jon Sporring · Robert Jenssen · Bulat Ibragimov |
|
Poster
|
Tue 2:30 |
Linear Explanations for Individual Neurons Tuomas Oikarinen · Lily Weng |
|
Poster
|
Tue 2:30 |
The Linear Representation Hypothesis and the Geometry of Large Language Models Kiho Park · Yo Joong Choe · Victor Veitch |
|
Poster
|
Wed 2:30 |
Interpreting and Improving Diffusion Models from an Optimization Perspective Frank Permenter · Chenyang Yuan |
|
Poster
|
Wed 2:30 |
Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks Rahul Ramesh · Ekdeep Singh Lubana · Mikail Khona · Robert Dick · Hidenori Tanaka |
|
Poster
|
Wed 4:30 |
Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion Ishaan Rawal · Alexander Matyasko · Shantanu Jaiswal · Basura Fernando · Cheston Tan |
|
Poster
|
Tue 4:30 |
Position: Amazing Things Come From Having Many Good Models Cynthia Rudin · Chudi Zhong · Lesia Semenova · Margo Seltzer · Ron Parr · Jiachang Liu · Srikar Katta · Jon Donnelly · Harry Chen · Zachery Boner |
|
Poster
|
Thu 4:30 |
Removing Spurious Concepts from Neural Network Representations via Joint Subspace Estimation Floris Holstege · Bram Wouters · Noud van Giersbergen · Cees Diks |
|
Poster
|
Wed 4:30 |
A Multimodal Automated Interpretability Agent Tamar Rott Shaham · Sarah Schwettmann · Franklin Wang · Achyuta Rajaram · Evan Hernandez · Jacob Andreas · Antonio Torralba |
|
Poster
|
Wed 2:30 |
InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation Jacob Si · Wendy Yusi Cheng · Michael Cooper · Rahul G. Krishnan |
|
Poster
|
Tue 2:30 |
Learning to Intervene on Concept Bottlenecks David Steinmann · Wolfgang Stammer · Felix Friedrich · Kristian Kersting |
|
Poster
|
Wed 4:30 |
Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation Xuexin Chen · Ruichu Cai · Zhengting Huang · Yuxuan Zhu · Julien Horwood · Zhifeng Hao · Zijian Li · Jose Miguel Hernandez-Lobato |