Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

109 Results

<<   <   Page 3 of 10   >   >>
Poster
Tue 4:30 Finding NEM-U: Explaining unsupervised representation learning through neural network generated explanation masks
Bjørn Leth Møller · Christian Igel · Kristoffer Wickstrøm · Jon Sporring · Robert Jenssen · Bulat Ibragimov
Poster
Tue 2:30 Linear Explanations for Individual Neurons
Tuomas Oikarinen · Lily Weng
Poster
Tue 2:30 The Linear Representation Hypothesis and the Geometry of Large Language Models
Kiho Park · Yo Joong Choe · Victor Veitch
Poster
Wed 2:30 Interpreting and Improving Diffusion Models from an Optimization Perspective
Frank Permenter · Chenyang Yuan
Poster
Wed 2:30 Compositional Capabilities of Autoregressive Transformers: A Study on Synthetic, Interpretable Tasks
Rahul Ramesh · Ekdeep Singh Lubana · Mikail Khona · Robert Dick · Hidenori Tanaka
Poster
Wed 4:30 Dissecting Multimodality in VideoQA Transformer Models by Impairing Modality Fusion
Ishaan Rawal · Alexander Matyasko · Shantanu Jaiswal · Basura Fernando · Cheston Tan
Poster
Tue 4:30 Position: Amazing Things Come From Having Many Good Models
Cynthia Rudin · Chudi Zhong · Lesia Semenova · Margo Seltzer · Ron Parr · Jiachang Liu · Srikar Katta · Jon Donnelly · Harry Chen · Zachery Boner
Poster
Thu 4:30 Removing Spurious Concepts from Neural Network Representations via Joint Subspace Estimation
Floris Holstege · Bram Wouters · Noud van Giersbergen · Cees Diks
Poster
Wed 4:30 A Multimodal Automated Interpretability Agent
Tamar Rott Shaham · Sarah Schwettmann · Franklin Wang · Achyuta Rajaram · Evan Hernandez · Jacob Andreas · Antonio Torralba
Poster
Wed 2:30 InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation
Jacob Si · Wendy Yusi Cheng · Michael Cooper · Rahul G. Krishnan
Poster
Tue 2:30 Learning to Intervene on Concept Bottlenecks
David Steinmann · Wolfgang Stammer · Felix Friedrich · Kristian Kersting
Poster
Wed 4:30 Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation
Xuexin Chen · Ruichu Cai · Zhengting Huang · Yuxuan Zhu · Julien Horwood · Zhifeng Hao · Zijian Li · Jose Miguel Hernandez-Lobato