ICML 2025 Tuesday 07/15

Timezone: America/Vancouver

Full Schedule Sun 7/13 Mon 7/14 Tue 7/15 Wed 7/16 Thu 7/17 Fri 7/18 Sat 7/19

Registration Desk

ICML Lounge Area

7:30 AM - 7:00 PM

This meeting room is for ICML delegates to relax and recharge in a comfortable environment.

... more

Affinity Workshop

New In ML

Tianyu Zhang · Andrew Williams · Zach Xu · Lu Li · Edwin Zhang · Juan A. Rodriguez · Suyuchen Wang · Felix Friedrich · Perouz Taslakian · Sai Rajeswar · Spandana Gella · Irina Rish · Jie Fu

8:00 AM - 5:30 PM

The New In ML workshop is an affinity workshop which designed to empower early-career machine learning researchers by providing mentorship, practical guidance, and an inclusive forum for professional growth. This workshop will feature keynote sessions, including a discussion on AI research principles (including AI safety), as well as targeted talks from leaders in academia and industry, covering topics such as research best practices, reproducibility, and the effective use of large language models for coding, writing, and reviewing. Participants are invited to submit their work through a dual-track call for papers, comprising a main track for preliminary research and a reproducibility track aimed at validating existing findings. Through interactive sessions, personalized feedback from senior reviewers, and comprehensive career guidance, the workshop aims to cultivate a collaborative community that supports the transition of new researchers into competitive, impactful contributors in the field of machine learning.

... more

Invited Talk

AI's Models of the World, and Ours

Jon Kleinberg

8:30 AM - 9:30 AM

Many different threads in recent work on generative AI address the simultaneous challenge of evaluating an AI system's explicit behavior at one level and its implicit representations of the world at another. Such distinctions become crucial as we interact with powerful AI systems, where a mismatch between the system's model of the world and our model of the world can lead to measurable situations in which the system has inadvertently `set us up to fail' through our interaction with it. We explore these questions through the lens of generation, drawing on examples from game-playing, geographic navigation, and other complex tasks: When we train a model to win chess games, what happens when we pair it with a weaker partner who makes some of the moves? When we train a model to find shortest paths, what happens when it has to deal with unexpected detours? The picture we construct is further complicated by theoretical results indicating that successful generation can be achieved even by agents that are provably incapable of identifying the model they're generating from.

The talk will include joint work with Ashton Anderson, Karim Hamade, Reid McIlroy-Young, Siddhartha Sen, Justin Chen, Sendhil Mullainathan, Ashesh Rambachan, Keyon Vafa, and Fan Wei.

... more

Speaker Bio

I am a professor at Cornell University. My research focuses on algorithms and networks, the roles they play in large-scale social and information systems, and their broader societal implications. My work has been supported by an NSF Career Award, an ONR Young Investigator Award, a MacArthur Foundation Fellowship, a Packard Foundation Fellowship, a Simons Investigator Award, a Sloan Foundation Fellowship, a Vannevar Bush Faculty Fellowship, and grants from Facebook, Google, Yahoo, the MacArthur and Simons Foundations, and the AFOSR, ARO, and NSF. I am a member of the National Academy of Sciences, the National Academy of Engineering, the American Academy of Arts and Sciences, and the American Philosophical Society.

... more

Exhibit Hall

Exhibits

9:30 AM - 6:00 PM

Oral

Oral 1E Theory and Phenomenology

10:00 AM - 11:00 AM

4 Events in this session

An analytic theory of creativity in convolutional diffusion models

Mason Kamb · Surya Ganguli

Layer by Layer: Uncovering Hidden Representations in Language Models

Oscar Skean · Md Rifat Arefin · Dan Zhao · Niket Patel · Jalal Naghiyev · Yann LeCun · Ravid Shwartz-Ziv

Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks

Shikai Qiu · Lechao Xiao · Andrew Wilson · Jeffrey Pennington · Atish Agarwala

Emergence in non-neural models: grokking modular arithmetic via average gradient outer product

Neil Mallinar · Daniel Beaglehole · Libin Zhu · Adityanarayanan Radhakrishnan · Parthe Pandit · Misha Belkin

Go to Event Page

Oral

Oral 1B Positions: Better Ways to Do Machine Learning

10:00 AM - 11:00 AM

4 Events in this session

Position: The AI Conference Peer Review Crisis Demands Author Feedback and Reviewer Rewards

Jaeho Kim · Yunseok Lee · Seulki Lee

Position: Not All Explanations for Deep Learning Phenomena Are Equally Valuable

Alan Jeffares · Mihaela van der Schaar

Position: Certified Robustness Does Not (Yet) Imply Model Security

Andrew C. Cullen · Paul MONTAGUE · Sarah Erfani · Benjamin Rubinstein

Position: Probabilistic Modelling is Sufficient for Causal Inference

Bruno Mlodozeniec · David Krueger · Richard E Turner

Go to Event Page

Oral

Oral 1D Learning Dynamics 1

10:00 AM - 11:00 AM

4 Events in this session

Algorithm Development in Neural Networks: Insights from the Streaming Parity Task

Loek van Rossem · Andrew Saxe

Learning Dynamics in Continual Pre-Training for Large Language Models

Xingjin Wang · Howe Tissue · Lu Wang · Linjing Li · Daniel Zeng

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Aaditya Singh · Ted Moskovitz · Sara Dragutinović · Feilx Hill · Stephanie Chan · Andrew Saxe

Transformative or Conservative? Conservation laws for ResNets and Transformers

Sibylle Marcotte · Rémi Gribonval · Gabriel Peyré

Go to Event Page

Oral

Oral 1C Applications in Computer Vision

10:00 AM - 11:00 AM

4 Events in this session

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Xilin Wei · Xiaoran Liu · Yuhang Zang · Xiaoyi Dong · Pan Zhang · Yuhang Cao · Jian Tong · Haodong Duan · Qipeng Guo · Jiaqi Wang · Xipeng Qiu · Dahua Lin

ReferSplat: Referring Segmentation in 3D Gaussian Splatting

Shuting He · Guangquan Jie · Changshuo Wang · Yun Zhou · Shuming Hu · Guanbin Li · Henghui Ding

Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection

Zhiyuan Yan · Jiangming Wang · Peng Jin · Ke-Yue Zhang · Chengchun Liu · Shen Chen · Taiping Yao · Shouhong Ding · Baoyuan Wu · Li Yuan

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Hila Chefer · Uriel Singer · Amit Zohar · Yuval Kirstain · Adam Polyak · Yaniv Taigman · Lior Wolf · Shelly Sheynin

Go to Event Page

Oral

Oral 1A Alignment and Agents

10:00 AM - 11:00 AM

4 Events in this session

Multi-agent Architecture Search via Agentic Supernet

Guibin Zhang · Luyang Niu · Junfeng Fang · Kun Wang · LEI BAI · Xiang Wang

Training a Generally Curious Agent

Fahim Tajwar · Yiding Jiang · Abitha Thankaraj · Sumaita Rahman · Zico Kolter · Jeff Schneider · Russ Salakhutdinov

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Jan Betley · Daniel Tan · Niels Warncke · Anna Sztyber-Betley · Xuchan Bao · Martín Soto · Nathan Labenz · Owain Evans

CollabLLM: From Passive Responders to Active Collaborators

Shirley Wu · Michel Galley · Baolin Peng · Hao Cheng · Gavin Li · Yao Dou · Weixin Cai · James Zou · Jure Leskovec · Jianfeng Gao

Go to Event Page

Poster

Poster Session 1 East

11:00 AM - 1:30 PM

353 Events in this session

CollabLLM: From Passive Responders to Active Collaborators

Shirley Wu · Michel Galley · Baolin Peng · Hao Cheng · Gavin Li · Yao Dou · Weixin Cai · James Zou · Jure Leskovec · Jianfeng Gao

The Disparate Benefits of Deep Ensembles

Kajetan Schweighofer · Adrián Arnaiz-Rodríguez · Sepp Hochreiter · Nuria Oliver

Relative Error Fair Clustering in the Weak-Strong Oracle Model

Vladimir Braverman · Prathamesh Dharangutte · Shaofeng Jiang · Hoai-An Nguyen · Chen Wang · Yubo Zhang · Samson Zhou

FDGen: A Fairness-Aware Graph Generation Model

Zichong Wang · Wenbin Zhang

GHOST: Generalizable One-Shot Federated Graph Learning with Proxy-Based Topology Knowledge Retention

Jiaru Qian · Guancheng Wan · Wenke Huang · Guibin Zhang · Yuxin Wu · Bo Du · Mang Ye

CAN: Leveraging Clients As Navigators for Generative Replay in Federated Continual Learning

Xuankun Rong · Jianshu Zhang · Kun He · Mang Ye

Cape: Context-Aware Prompt Perturbation Mechanism with Differential Privacy

Haoqi Wu · Wei Dai · Wang Li · Qiang Yan

Privacy Amplification by Structured Subsampling for Deep Differentially Private Time Series Forecasting

Jan Schuchardt · Mina Dalirrooyfard · Jed Guzelkabaagac · Anderson Schneider · Yuriy Nevmyvaka · Stephan Günnemann

Tracking The Best Expert Privately

Hilal Asi · Vinod Raman · Aadirupa Saha

Privacy Amplification Through Synthetic Data: Insights from Linear Regression

Clément Pierquin · Aurélien Bellet · Marc Tommasi · Matthieu Boussard

Layer-wise Alignment: Examining Safety Alignment Across Image Encoder Layers in Vision Language Models

Saketh Bachu · Erfan Shayegani · Rohit Lal · Trishna Chakraborty · Arindam Dutta · Chengyu Song · Yue Dong · Nael Abu-Ghazaleh · Amit Roy-Chowdhury

Preference learning made easy: Everything should be understood through win rate

Lily Zhang · Rajesh Ranganath

Vision-Language Models Create Cross-Modal Task Representations

Grace Luo · Trevor Darrell · Amir Bar

Detecting Strategic Deception with Linear Probes

Nicholas Goldowsky-Dill · Bilal Chughtai · Stefan Heimersheim · Marius Hobbhahn

Which Attention Heads Matter for In-Context Learning?

Kayo Yin · Jacob Steinhardt

Tensor Product Neural Networks for Functional ANOVA Model

Seokhun Park · Insung Kong · yongchan Choi · Chanmoo Park · Yongdai Kim

From Pixels to Perception: Interpretable Predictions via Instance-wise Grouped Feature Selection

Moritz Vandenhirtz · Julia Vogt

Fast Estimation of Partial Dependence Functions using Trees

Jinyang Liu · Tessa Steensgaard · Marvin N. Wright · Niklas Pfister · Munir Hiabu

AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders

Zhengxuan Wu · Aryaman Arora · Atticus Geiger · Zheng Wang · Jing Huang · Dan Jurafsky · Christopher Manning · Christopher Potts

On the Interplay between Graph Structure and Learning Algorithms in Graph Neural Networks

Junwei Su · Chuan Wu

Data-driven Design of Randomized Control Trials with Guaranteed Treatment Effects

Santiago Cortes-Gomez · Naveen Raman · Aarti Singh · Bryan Wilder

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

Adam Karvonen · Can Rager · Johnny Lin · Curt Tigges · Joseph Bloom · David Chanin · Yeu-Tong Lau · Eoin Farrell · Callum McDougall · Kola Ayonrinde · Demian Till · Matthew Wearden · Arthur Conmy · Samuel Marks · Neel Nanda

Learning Multi-Level Features with Matryoshka Sparse Autoencoders

Bart Bussmann · Noa Nabeshima · Adam Karvonen · Neel Nanda

Towards Attributions of Input Variables in a Coalition

Xinhao Zheng · Huiqi Deng · Quanshi Zhang

Auditing Prompt Caching in Language Model APIs

Chenchen Gu · Xiang Li · Rohith Kuditipudi · Percy Liang · Tatsunori Hashimoto

Towards Global-level Mechanistic Interpretability: A Perspective of Modular Circuits of Large Language Models

Yinhan He · Wendy Zheng · Yushun Dong · Yaochen Zhu · Chen Chen · Jundong Li

(How) Can Transformers Predict Pseudo-Random Numbers?

Tao Tao · Darshil Doshi · Dayal Singh Kalra · Tianyu He · Maissam Barkeshli

Prediction via Shapley Value Regression

Amr Alkhatib · Roman Bresson · Henrik Boström · Michalis Vazirgiannis

Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence

Gouki Minegishi · Hiroki Furuta · Shohei Taniguchi · Yusuke Iwasawa · Yutaka Matsuo

Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

Mozhi Zhang · Howe Tissue · Lu Wang · Xipeng Qiu

On the Impact of Performative Risk Minimization for Binary Random Variables

Nikita Tsoy · Ivan Kirev · Negin Rahimiyazdi · Nikola Konstantinov

A Mathematical Framework for AI-Human Integration in Work

L. Elisa Celis · Lingxiao Huang · Nisheeth K. Vishnoi

Neural Guided Diffusion Bridges

Gefan Yang · Frank van der Meulen · Stefan Sommer

Learning-Order Autoregressive Models with Application to Molecular Graph Generation

Zhe Wang · Jiaxin Shi · Nicolas Heess · Arthur Gretton · Michalis Titsias

Revisiting Unbiased Implicit Variational Inference

Tobias Pielok · Bernd Bischl · David Rügamer

Federated Generalised Variational Inference: A Robust Probabilistic Federated Learning Framework

Terje Mildner · Oliver Hamelijnck · Paris Giampouras · Theodoros Damoulas

Aggregation of Dependent Expert Distributions in Multimodal Variational Autoencoders

Rogelio A. Mancisidor · Robert Jenssen · Shujian Yu · Michael Kampffmeyer

Exploring Large Action Sets with Hyperspherical Embeddings using von Mises-Fisher Sampling

Walid Bendada · Guillaume Salha-Galvan · Romain Hennequin · Théo Bontempelli · Thomas Bouabca · Tristan Cazenave

A General Framework for Inference-time Scaling and Steering of Diffusion Models

Raghav Singhal · Zachary Horvitz · Ryan Teehan · Mengye Ren · Zhou Yu · Kathleen McKeown · Rajesh Ranganath

Tuning Sequential Monte Carlo Samplers via Greedy Incremental Divergence Minimization

Kyurae Kim · Zuheng Xu · Jacob Gardner · Trevor Campbell

Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning

Ngoc Bui · Menglin Yang · Runjin Chen · Leonardo Neves · Mingxuan Ju · ZHITAO YING · Neil Shah · Tong Zhao

Fast and Provable Algorithms for Sparse PCA with Improved Sample Complexity

Jian-Feng Cai · Zhuozhi XIAN · Jiaxi Ying

Wrapped Gaussian on the manifold of Symmetric Positive Definite Matrices

Thibault de Surrel · Fabien Lotte · Sylvain Chevallier · Florian Yger

Doubly Robust Conformalized Survival Analysis with Right-Censored Data

Matteo Sesia · vladimir svetnik

Do Bayesian Neural Networks Actually Behave Like Bayesian Models?

Gábor Pituk · Vik Shirvaikar · Tom Rainforth

BILBO: BILevel Bayesian Optimization

Ruth Wan Theng Chew · Quoc Phong Nguyen · Bryan Kian Hsiang Low

Integration-free Kernels for Equivariant Gaussian Process Modelling

Tim Steinert · David Ginsbourger · August Lykke-Møller · Ove Christiansen · Henry Moss

Robust and Conjugate Spatio-Temporal Gaussian Processes

William Laplante · Matias Altamirano · Andrew Duncan · Jeremias Knoblauch · Francois-Xavier Briol

Pareto-frontier Entropy Search with Variational Lower Bound Maximization

Masanori Ishikura · Masayuki Karasuyama

Revisiting Non-Acyclic GFlowNets in Discrete Environments

Nikita Morozov · Ian Maksimov · Daniil Tiapkin · Sergey Samsonov

When to retrain a machine learning model

Florence Regol · Leo Schwinn · Kyle Sprague · Mark Coates · Thomas L Markovich

Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications

Maria Despoina Siampou · Jialiang Li · John Krumm · Cyrus Shahabi · Hua Lu

LBI-FL: Low-Bit Integerized Federated Learning with Temporally Dynamic Bit-Width Allocation

Li Ding · Hao Zhang · Wenrui Dai · Chenglin Li · Weijia Lu · ZHIFEI YANG · xiaodong Zhang · Xiaofeng Ma · Junni Zou · Hongkai Xiong

Navigating Conflicting Views: Harnessing Trust for Learning

Jueqing Lu · Wray Buntine · Yuanyuan Qi · Joanna Dipnall · Belinda Gabbe · Lan Du

Nonparametric Teaching for Graph Property Learners

Chen Zhang · Weixin Bu · Zeyi Ren · Zhengwu Liu · Yik-Chung WU · Ngai Wong

Confidence Difference Reflects Various Supervised Signals in Confidence-Difference Classification

Yuanchao Dai · Ximing Li · Changchun Li

Learning to Match Unpaired Data with Minimum Entropy Coupling

Mustapha Bounoua · Giulio Franzese · Pietro Michiardi

Universal Neural Optimal Transport

Jonathan Geuter · Gregor Kornhardt · Ingimar Tomasson · Vaios Laschos

Random Registers for Cross-Domain Few-Shot Learning

Shuai Yi · Yixiong Zou · Yuhua Li · Ruixuan Li

Test-Time Selective Adaptation for Uni-Modal Distribution Shift in Multi-Modal Data

MingCai Chen · Baoming Zhang · Zongbo Han · Wenyu Jiang · Yanmeng Wang · Shuai Feng · Yuntao Du · Bingkun BAO

Instance Correlation Graph-based Naive Bayes

Chengyuan Li · Liangxiao Jiang · Wenjun Zhang · Liangjun Yu · Huan Zhang

Ab Initio Nonparametric Variable Selection for Scalable Symbolic Regression with Large $p$

Shengbin Ye · Meng Li

Knowledge-Guided Wasserstein Distributionally Robust Optimization

Zitao Wang · Ziyuan Wang · Molei Liu · Nian Si

Does learning the right latent variables necessarily improve in-context learning?

Sarthak Mittal · Eric Elmoznino · Léo Gagnon · Sangnie Bhardwaj · Guillaume Lajoie · Dhanya Sridhar

Update Your Transformer to the Latest Release: Re-Basin of Task Vectors

Filippo Rinaldi · Giacomo Capitani · Lorenzo Bonicelli · Donato Crisostomi · Federico Bolelli · ELISA FICARRA · Emanuele Rodola · Simone Calderara · Angelo Porrello

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Zhongyang Li · Ziyue Li · Tianyi Zhou

iDPA: Instance Decoupled Prompt Attention for Incremental Medical Object Detection

Huahui Yi · Wei Xu · Ziyuan Qin · Xi Chen · Xiaohu Wu · Kang Li · Qicheng Lao

Be Confident: Uncovering Overfitting in MLLM Multi-Task Tuning

Wenke Huang · Jian Liang · Guancheng Wan · Didi Zhu · He Li · Jiawei Shao · Mang Ye · Bo Du · Dacheng Tao

Homophily Enhanced Graph Domain Adaptation

Ruiyi Fang · Bingheng Li · Jingyu Zhao · Ruizhi Pu · QIUHAO Zeng · Gezheng Xu · Charles X. Ling · Boyu Wang

Revisiting Continuity of Image Tokens for Cross-domain Few-shot Learning

Shuai Yi · Yixiong Zou · Yuhua Li · Ruixuan Li

Divide and Conquer: Learning Label Distribution with Subtasks

Haitao Wu · Weiwei Li · Xiuyi Jia

GRAIL: Graph Edit Distance and Node Alignment using LLM-Generated Code

Samidha Verma · Arushi Goyal · Ananya Mathur · Ankit Anand · Sayan Ranu

Soup-of-Experts: Pretraining Specialist Models via Parameters Averaging

Pierre Ablin · Angelos Katharopoulos · Skyler Seto · David Grangier

Randomized Dimensionality Reduction for Euclidean Maximization and Diversity Measures

Jie Gao · Rajesh Jayaram · Benedikt Kolbe · Shay Sapir · Chris Schwiegelshohn · Sandeep Silwal · Erik Waingarten

Learning Optimal Multimodal Information Bottleneck Representations

Qilong Wu · Yiyang Shao · Jun Wang · Xiaobo Sun

Diss-l-ECT: Dissecting Graph Data with Local Euler Characteristic Transforms

Julius Von Rohrscheidt · Bastian Rieck

Latent Score-Based Reweighting for Robust Classification on Imbalanced Tabular Data

Yunze Tong · Fengda Zhang · Zihao Tang · Kaifeng Gao · Kai Huang · Pengfei Lyu · Jun Xiao · Kun Kuang

Deep Unsupervised Hashing via External Guidance

Qihong Song · XitingLiu · Hongyuan Zhu · Joey Tianyi Zhou · Xi Peng · Peng Hu

GMAIL: Generative Modality Alignment for generated Image Learning

Shentong Mo · Sukmin Yun

Aligning Multimodal Representations through an Information Bottleneck

Antonio Almudévar · Jose Miguel Hernandez-Lobato · Sameer Khurana · Ricard Marxer · Alfonso Ortega

Uncertainty Quantification for LLM-Based Survey Simulations

Chengpiao Huang · Yuhang Wu · Kaizheng Wang

CogMath: Assessing LLMs' Authentic Mathematical Ability from a Human Cognitive Perspective

Jiayu Liu · Zhenya Huang · Wei Dai · Cheng Cheng · Jinze Wu · Jing Sha · Song Li · Qi Liu · Shijin Wang · Enhong Chen

Measuring Diversity in Synthetic Datasets

Yuchang Zhu · Huizhe Zhang · Bingzhe Wu · Jintang Li · Zibin Zheng · Peilin Zhao · Liang Chen · Yatao Bian

Copilot Arena: A Platform for Code LLM Evaluation in the Wild

Wayne Chi · Valerie Chen · Anastasios Angelopoulos · Wei-Lin Chiang · Aditya Mittal · Naman Jain · Tianjun Zhang · Ion Stoica · Chris Donahue · Ameet Talwalkar

LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations

Anian Ruoss · Fabio Pardo · Harris Chan · Bonnie Li · Vlad Mnih · Tim Genewein

Toward Efficient Kernel-Based Solvers for Nonlinear PDEs

Zhitong Xu · Da Long · Yiming Xu · Guang Yang · Shandian Zhe · Houman Owhadi

Statistical and Computational Guarantees of Kernel Max-Sliced Wasserstein Distances

Jie Wang · March Boedihardjo · Yao Xie

Competing Bandits in Matching Markets via Super Stability

Soumya Basu

Comparing Few to Rank Many: Active Human Preference Learning Using Randomized Frank-Wolfe Method

Kiran Thekumparampil · Gaurush Hiranandani · Kousha Kalantari · Shoham Sabach · Branislav Kveton

Non-stationary Online Learning for Curved Losses: Improved Dynamic Regret via Mixability

Yu-Jie Zhang · Peng Zhao · Masashi Sugiyama

Active feature acquisition via explainability-driven ranking

Osman Berke Guney · Ketan Saichandran · Karim Elzokm · Ziming Zhang · Vijaya Kolachalama

Clustering Items through Bandit Feedback: Finding the Right Feature out of Many

Maximilian Graf · Victor Thuot · Nicolas Verzelen

Preconditioned Riemannian Gradient Descent Algorithm for Low-Multilinear-Rank Tensor Completion

Yuanwei Zhang · Fengmiao Bian · Xiaoqun Zhang · Jian-Feng Cai

Deep Streaming View Clustering

Honglin Yuan · Xingfeng Li · Jian Dai · Xiaojian You · Yuan Sun · Zhenwen Ren

Signed Laplacians for Constrained Graph Clustering

John Stewart Fabila-Carrasco · He Sun

Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model

Kaito Ariu · Alexandre Proutiere · Se-Young Yun

Arrow: Accelerator for Time Series Causal Discovery with Time Weaving

Yuanyuan Yao · Yuan Dong · Lu Chen · Kun Kuang · Ziquan Fang · Cheng Long · Yunjun Gao · TIANYI LI

Quantifying Treatment Effects: Estimating Risk Ratios via Observational Studies

Ahmed Boughdiri · julie Josse · Erwan Scornet

A Recipe for Causal Graph Regression: Confounding Effects Revisited

Yujia Yin · Tianyi Qu · Zihao Wang · Yifan Chen

Isolated Causal Effects of Natural Language

Victoria Lin · Louis-Philippe Morency · Eli Ben-Michael

Distinguishing Cause from Effect with Causal Velocity Models

Johnny Xi · Hugh Dance · Peter Orbanz · Benjamin Bloem-Reddy

Data-Driven Selection of Instrumental Variables for Additive Nonlinear, Constant Effects Models

Xichen Guo · Feng Xie · Yan Zeng · Hao Zhang · zhi geng

Counterfactual Effect Decomposition in Multi-Agent Sequential Decision Making

Stelios Triantafyllou · Aleksa Sukovic · Yasaman Zolfimoselo · Goran Radanovic

A Causal World Model Underlying Next Token Prediction: Exploring GPT in a Controlled Environment

Raanan Yehezkel Rohekar · Yaniv Gurwicz · Sungduk Yu · Estelle Aflalo Guez · Vasudev Lal

Telling Peer Direct Effects from Indirect Effects in Observational Network Data

Xiaojing Du · Jiuyong Li · Debo Cheng · Lin Liu · Wentao Gao · XIONGREN CHEN · Ziqi Xu

A Sample Efficient Conditional Independence Test in the Presence of Discretization

Boyang Sun · Yu Yao · Xinshuai Dong · Zongfang Liu · Tongliang Liu · Yumou Qiu · Kun Zhang

BoA: Attention-aware Post-training Quantization without Backpropagation

Junhan Kim · Ho-young Kim · Eulrang Cho · Chungman Lee · Joonyoung Kim · Yongkweon Jeon

Tackling Dimensional Collapse toward Comprehensive Universal Domain Adaptation

Hung-Chieh Fang · Po-Yi Lu · Hsuan-Tien (Tien) Lin

Learning without Isolation: Pathway Protection for Continual Learning

Zhikang Chen · Abudukelimu Wuerkaixi · Sen Cui · Haoxuan Li · Ding Li · Jingfeng Zhang · Bo Han · Gang Niu · Houfang Liu · Yi Yang · Sifan YANG · Changshui Zhang · Tianling Ren

DOLPHIN: A Programmable Framework for Scalable Neurosymbolic Learning

Aaditya Naik · Jason Liu · Claire Wang · Amish Sethi · Saikat Dutta · Mayur Naik · Eric Wong

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Nayoung Lee · Jack Cai · Avi Schwarzschild · Kangwook Lee · Dimitris Papailiopoulos

A Closer Look at Generalized BH Algorithm for Out-of-Distribution Detection

Xinsong Ma · Jie Wu · Weiwei Liu

Enhancing Adversarial Robustness with Conformal Prediction: A Framework for Guaranteed Model Reliability

Jie Bao · Chuangyin Dang · Rui Luo · Hanwei Zhang · Zhixin Zhou

Bifurcate then Alienate: Incomplete Multi-view Clustering via Coupled Distribution Learning with Linear Overhead

Shengju Yu · Yiu-ming Cheung · Siwei Wang · Xinwang Liu · En Zhu

FedSSI: Rehearsal-Free Continual Federated Learning with Synergistic Synaptic Intelligence

Yichen Li · Yuying Wang · Haozhao Wang · Yining Qi · Tianzhe Xiao · Ruixuan Li

Joint Metric Space Embedding by Unbalanced Optimal Transport with Gromov–Wasserstein Marginal Penalization

Florian Beier · Moritz Piening · Robert Beinert · Gabriele Steidl

Improving the Variance of Differentially Private Randomized Experiments through Clustering

Adel Javanmard · Vahab Mirrokni · Jean Pouget-Abadie

Doubly Protected Estimation for Survival Outcomes Utilizing External Controls for Randomized Clinical Trials

Chenyin Gao · Shu Yang · Mingyang Shan · Wenyu Ye · Ilya Lipkovich · Douglas Faries

Balancing Interference and Correlation in Spatial Experimental Designs: A Causal Graph Cut Approach

Jin Zhu · Jingyi Li · Hongyi Zhou · Yinan Lin · Zhenhua Lin · Chengchun Shi

AtlasD: Automatic Local Symmetry Discovery

Manu Bhat · Jonghyun Park · Jianke Yang · Nima Dehmamy · Robin Walters · Rose Yu

Investigating the Overlooked Hessian Structure: From CNNs to LLMs

Qian-Yuan Tang · Yufei Gu · Yunfeng Cai · Mingming Sun · Ping Li · zhou Xun · Zeke Xie

Concept-Centric Token Interpretation for Vector-Quantized Generative Models

Tianze Yang · Yucheng Shi · Mengnan Du · Xuansheng Wu · Qiaoyu Tan · Jin Sun · Ninghao Liu

CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning

Qingqing Cao · Mahyar Najibi · Sachin Mehta

A Simple Model of Inference Scaling Laws

Noam Levi

Minimum Width for Universal Approximation using Squashable Activation Functions

Jonghyun Shin · Namjun Kim · Geonho Hwang · Sejun Park

Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry

Mohammed Adnan · Rohan Jain · Ekansh Sharma · Rahul G. Krishnan · Yani Ioannou

On the Power of Context-Enhanced Learning in LLMs

Xingyu Zhu · Abhishek Panigrahi · Sanjeev Arora

Beyond Zero Initialization: Investigating the Impact of Non-Zero Initialization on LoRA Fine-Tuning Dynamics

Shiwei Li · Xiandi Luo · Xing Tang · Haozhao Wang · Hao Chen · weihongluo · Yuhua Li · xiuqiang He · Ruixuan Li

When Diffusion Models Memorize: Inductive Biases in Probability Flow of Minimum-Norm Shallow Neural Nets

Chen Zeno · Hila Manor · Gregory Ongie · Nir Weinberger · Tomer Michaeli · Daniel Soudry

Neural Collapse Beyond the Unconstrained Features Model: Landscape, Dynamics, and Generalization in the Mean-Field Regime

Diyuan Wu · Marco Mondelli

Transformative or Conservative? Conservation laws for ResNets and Transformers

Sibylle Marcotte · Rémi Gribonval · Gabriel Peyré

Global curvature for second-order optimization of neural networks

Alberto Bernacchia

TabFSBench: Tabular Benchmark for Feature Shifts in Open Environments

Zijian Cheng · 贾子怡 · Zhi Zhou · Yu-Feng Li · Lan-Zhe Guo

Advancing Personalized Learning with Neural Collapse for Long-Tail Challenge

Hanglei Hu · Yingying Guo · Zhikang Chen · Sen Cui · Fei Wu · Kun Kuang · Min Zhang · Bo Jiang

Towards Robust Influence Functions with Flat Validation Minima

Xichen Ye · Yifan Wu · Weizhong Zhang · Cheng Jin · Yifan Chen

Test-Time Canonicalization by Foundation Models for Robust Perception

Utkarsh Singhal · Ryan Feng · Stella Yu · Atul Prakash

Bi-perspective Splitting Defense: Achieving Clean-Seed-Free Backdoor Security

Yangyang Shen · Xiao Tan · Dian Shen · Meng Wang · Beilun Wang

LAION-C: An Out-of-Distribution Benchmark for Web-Scale Vision Models

Fanfei Li · Thomas Klein · Wieland Brendel · Robert Geirhos · Roland S. Zimmermann

Adversarial Perturbations Are Formed by Iteratively Learning Linear Combinations of the Right Singular Vectors of the Adversarial Jacobian

Thomas Paniagua · Chinmay Savadikar · Tianfu Wu

OOD-Chameleon: Is Algorithm Selection for OOD Generalization Learnable?

Liangze Jiang · Damien Teney

MTSTRec: Multimodal Time-Aligned Shared Token Recommender

Ming-Yi Hong · Yen-Jung Hsu · Miao-Chen Chiang · Che Lin

Lightweight Online Adaption for Time Series Foundation Model Forecasts

Thomas Lee · William Toner · Rajkarn Singh · Artjom Joosen · Martin Asenov

$K^2$VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series Forecasting

Xingjian Wu · Xiangfei Qiu · Hongfan Gao · Jilin Hu · Bin Yang · Chenjuan Guo

Winner-takes-all for Multivariate Probabilistic Time Series Forecasting

Adrien Cortes · Remi Rehm · Victor Letzelter

An analytic theory of creativity in convolutional diffusion models

Mason Kamb · Surya Ganguli

Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Xu Zhang · Kaidi Xu · Ziqing Hu · Ren Wang

Quadratic Upper Bound for Boosting Robustness

Euijin You · Hyang-Won Lee

From Debate to Equilibrium: Belief‑Driven Multi‑Agent LLM Reasoning via Bayesian Nash Equilibrium

Yi Xie · Zhanke Zhou · Chentao Cao · Qiyu Niu · Tongliang Liu · Bo Han

An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN Architectures

Thibaut Boissin · Franck Mamalet · Thomas Fel · Agustin Picard · Thomas Massena · Mathieu Serrurier

Boosting Adversarial Robustness with CLAT: Criticality Leveraged Adversarial Training

Bhavna Gopal · Huanrui Yang · Jingyang Zhang · Mark Horton · Yiran Chen

SGD Jittering: A Training Strategy for Robust and Accurate Model-Based Architectures

Peimeng Guan · Mark Davenport

Stochastic Layer-Wise Shuffle for Improving Vision Mamba Training

Zizheng Huang · Haoxing Chen · Jiaqi Li · jun lan · Huijia Zhu · Weiqiang Wang · Limin Wang

Mechanistic PDE Networks for Discovery of Governing Equations

Adeel Pervez · Efstratios Gavves · Francesco Locatello

The Logical Implication Steering Method for Conditional Interventions on Transformer Generation

Damjan Kalajdzievski

PRIME: Deep Imbalanced Regression with Proxies

Jongin Lim · Sucheol Lee · Daeho Um · Sung-Un Park · Jinwoo Shin

Deep Principal Support Vector Machines for Nonlinear Sufficient Dimension Reduction

YinFeng Chen · Jin Liu · Rui Qiu

Double-Filter: Efficient Fine-tuning of Pre-trained Vision-Language Models via Patch&Layer Filtering

Yaoqin He · Junchen Fu · Kaiwen Zheng · Songpei Xu · Fuhai Chen · Jie Li · Joemon Jose · Xuri Ge

Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective

Seungwook Han · Jinyeop Song · Jeff Gore · Pulkit Agrawal

Large Language Models to Diffusion Finetuning

Edoardo Cetin · Tianyu Zhao · Yujin Tang

Peripheral Memory for LLMs: Integration of Sequential Memory Banks with Adaptive Querying

Songlin Zhai · Yuan Meng · Yongrui Chen · Yiwei Wang · Guilin Qi

The Synergy of LLMs & RL Unlocks Offline Learning of Generalizable Language-Conditioned Policies with Low-fidelity Data

Thomas Pouplin · Katarzyna Kobalczyk · Hao Sun · Mihaela van der Schaar

Prune 'n Predict: Optimizing LLM Decision-making with Conformal Prediction

Harit Vishwakarma · Alan Mishler · Thomas Cook · Niccolo Dalmasso · Natraj Raman · Sumitra Ganesh

Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models

Jialin Zhao · Yingtao Zhang · Carlo Cannistraci

OrthoRank: Token Selection via Sink Token Orthogonality for Efficient LLM inference

Seungjun Shin · Jaehoon Oh · Dokwan Oh

Restoring Calibration for Aligned Large Language Models: A Calibration-Aware Fine-Tuning Approach

Jiancong Xiao · Bojian Hou · Zhanliang Wang · Ruochen Jin · Qi Long · Weijie Su · Li Shen

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

Qinglin Zhu · Runcong Zhao · Hanqi Yan · Yulan He · Yudong Chen · Lin Gui

The Missing Alignment Link of In-context Learning on Sequences

Harshvardhan Agarwal · Sunita Sarawagi

RE-Bench: Evaluating Frontier AI R&D Capabilities of Language Model Agents against Human Experts

Hjalmar Wijk · Tao Lin · Joel Becker · Sami Jawhar · Neev Parikh · Thomas Broadley · Lawrence Chan · Michael Chen · Joshua Clymer · Jai Dhyani · Elena Ericheva · Katharyn Garcia · Brian Goodrich · Nikola Jurkovic · Megan Kinniment · Aron Lajko · Seraphina Nix · Lucas Jun Koba Sato · William Saunders · Maksym Taran · Ben West · Elizabeth Barnes

Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces

Anjiang Wei · Allen Nie · Thiago Teixeira · Rohan Yadav · Wonchan Lee · Ke Wang · Alex Aiken

Great Models Think Alike and this Undermines AI Oversight

Shashwat Goel · Joschka Strüber · Ilze Amanda Auzina · Karuna Chandra · Ponnurangam Kumaraguru · Douwe Kiela · Ameya Pandurang Prabhu · Matthias Bethge · Jonas Geiping

Tool Unlearning for Tool-Augmented LLMs

Jiali Cheng · Hadi Amiri

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

Dong HUANG · Guangtao Zeng · Jianbo Dai · Meng Luo · Han Weng · Yuhao QING · Heming Cui · Zhijiang Guo · Jie Zhang

RLTHF: Targeted Human Feedback for LLM Alignment

Yifei Xu · Tusher Chakraborty · Emre Kiciman · Bibek Aryal · Srinagesh Sharma · Songwu Lu · Ranveer Chandra

PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation

Albert Gong · Kamilė Stankevičiūtė · Chao Wan · Anmol Kabra · Raphael Thesmar · Johann Lee · Julius Klenke · Carla Gomes · Kilian Weinberger

Catch Your Emotion: Sharpening Emotion Perception in Multimodal Large Language Models

Yiyang Fang · Jian Liang · Wenke Huang · He Li · Kehua Su · Mang Ye

LLM Alignment as Retriever Optimization: An Information Retrieval Perspective

Bowen Jin · Jinsung Yoon · Zhen Qin · Ziqi Wang · Wei Xiong · Yu Meng · Jiawei Han · Sercan Arik

A Hitchhiker's Guide to Scaling Law Estimation

Leshem Choshen · Yang Zhang · Jacob Andreas

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

Bernal Jimenez Gutierrez · Yiheng Shu · Weijian Qi · Sizhe Zhou · Yu Su

SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator

Guoxuan Chen · Han Shi · jiawei li · Yihang Gao · Xiaozhe Ren · Yimeng Chen · Xin Jiang · Zhenguo Li · Weiyang Liu · Chao Huang

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning

Han Zhong · Yutong Yin · Shenao Zhang · Xiaojun Xu · Yuanxin Liu · Yifei Zuo · Zhihan Liu · Boyi Liu · Sirui Zheng · Hongyi Guo · Liwei Wang · Mingyi Hong · Zhaoran Wang

AMPO: Active Multi Preference Optimization for Self-play Preference Selection

Taneesh Gupta · Rahul Madhavan · Xuchao Zhang · Chetan Bansal · Saravanakumar Rajmohan

How Much Can We Forget about Data Contamination?

Sebastian Bordt · Suraj Srinivas · Valentyn Boreiko · Ulrike Luxburg

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Jonas Gehring · Kunhao Zheng · Jade Copet · Vegard Mella · Taco Cohen · Gabriel Synnaeve

Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding

Tian Jin · Ellie Cheng · Zachary Ankner · Nikunj Saunshi · Blake Elias · Amir Yazdanbakhsh · Jonathan Ragan-Kelley · Suvinay Subramanian · Michael Carbin

Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization

Ermo Hua · Che Jiang · Xingtai Lv · Kaiyan Zhang · Youbang Sun · Yuchen Fan · Xuekai Zhu · Biqing Qi · Ning Ding · Bowen Zhou

GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance

Jinuk Kim · Marwa El Halabi · Wonpyo Park · Clemens Schaefer · Deokjae Lee · Yeonhong Park · Jae W. Lee · Hyun Oh Song

Large Language Models are Demonstration Pre-Selectors for Themselves

Jiarui Jin · Yuwei Wu · Haoxuan Li · Xiaoting He · Weinan Zhang · Yiming Yang · Yong Yu · Jun Wang · Mengyue Yang

LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning

Zihang Liu · Tianyu Pang · Oleg Balabanov · Chaoqun Yang · Tianjin Huang · Lu Yin · Yaoqing Yang · Shiwei Liu

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

Ziyao Wang · Muneeza Azmat · Ang Li · Raya Horesh · Mikhail Yurochkin

Diving into Self-Evolving Training for Multimodal Reasoning

Wei Liu · Junlong Li · Xiwen Zhang · Fan Zhou · Yu Cheng · Junxian He

Layer by Layer: Uncovering Hidden Representations in Language Models

Oscar Skean · Md Rifat Arefin · Dan Zhao · Niket Patel · Jalal Naghiyev · Yann LeCun · Ravid Shwartz-Ziv

Learning Dynamics in Continual Pre-Training for Large Language Models

Xingjin Wang · Howe Tissue · Lu Wang · Linjing Li · Daniel Zeng

Training a Generally Curious Agent

Fahim Tajwar · Yiding Jiang · Abitha Thankaraj · Sumaita Rahman · Zico Kolter · Jeff Schneider · Russ Salakhutdinov

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Mingkang Zhu · Xi Chen · Zhongdao Wang · Bei Yu · Hengshuang Zhao · Jiaya Jia

Residual Matrix Transformers: Scaling the Size of the Residual Stream

Brian Mak · Jeffrey Flanigan

PILAF: Optimal Human Preference Sampling for Reward Modeling

Yunzhen Feng · Ariel Kwiatkowski · Kunhao Zheng · Julia Kempe · Yaqi Duan

Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models

Hung-Yueh Chiang · Chi-Chih Chang · Natalia Frumkin · Kai-Chiang Wu · Mohamed Abdelfattah · Diana Marculescu

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Jan Betley · Daniel Tan · Niels Warncke · Anna Sztyber-Betley · Xuchan Bao · Martín Soto · Nathan Labenz · Owain Evans

The Surprising Effectiveness of Test-Time Training for Few-Shot Learning

Ekin Akyürek · Mehul Damani · Adam Zweiger · Linlu Qiu · Han Guo · Jyothish Pari · Yoon Kim · Jacob Andreas

From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?

Zhanke Zhou · Xiao Feng · Zhaocheng Zhu · Jiangchao Yao · Sanmi Koyejo · Bo Han

Function-to-Style Guidance of LLMs for Code Translation

Longhui Zhang · Bin Wang · Jiahao Wang · Xiaofeng Zhao · Min Zhang · Hao yang · Meishan Zhang · YU LI · Jing Li · Jun Yu · Min Zhang

ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification

Hyunseok Lee · Seunghyuk Oh · Jaehyung Kim · Jinwoo Shin · Jihoon Tack

Self-Disentanglement and Re-Composition for Cross-Domain Few-Shot Segmentation

Jintao Tong · Yixiong Zou · Guangyao Chen · Yuhua Li · Ruixuan Li

The Canary’s Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Matthieu Meeus · Lukas Wutschitz · Santiago Zanella-Beguelin · Shruti Tople · Reza Shokri

MATS: An Audio Language Model under Text-only Supervision

Wen Wang · Ruibing Hou · Hong Chang · Shiguang Shan · Xilin Chen

HPS: Hard Preference Sampling for Human Preference Alignment

Xiandong Zou · Wanyu LIN · Yuchen Li · Pan Zhou

Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism

Aviv Bick · Eric Xing · Albert Gu

Focus On This, Not That! Steering LLMs with Adaptive Feature Specification

Tom A. Lamb · Adam Davies · Alasdair J Paren · Phil Torr · Francesco Pinto

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Yung-Sung Chuang · Benjamin Cohen-Wang · Shannon Shen · Zhaofeng Wu · Hu Xu · Xi Victoria Lin · James Glass · Shang-Wen Li · Scott Yih

On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention for Long-Context LLM Serving

Yeonju Ro · Zhenyu Zhang · Souvik Kundu · Zhangyang “Atlas” Wang · Aditya Akella

Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation

Zhuohao Yu · Weizheng Gu · Yidong Wang · Xingru Jiang · Zhengran Zeng · Jindong Wang · Wei Ye · Shikun Zhang

Adapting While Learning: Grounding LLMs for Scientific Problems with Tool Usage Adaptation

Bohan Lyu · Yadi Cao · Duncan Watson-Parris · Leon Bergen · Taylor Berg-Kirkpatrick · Rose Yu

ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

Hanshi Sun · Li-Wen Chang · Wenlei Bao · Size Zheng · Ningxin Zheng · Xin Liu · Harry Dong · Yuejie Chi · Beidi Chen

Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation

Jintao Tong · Ran Ma · Yixiong Zou · Guangyao Chen · Yuhua Li · Ruixuan Li

Splitting with Importance-aware Updating for Heterogeneous Federated Learning with Large Language Models

Yangxu Liao · Wenke Huang · Guancheng Wan · Jian Liang · Bin Yang · Mang Ye

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Shiqi Chen · Jinghan Zhang · Tongyao Zhu · Wei Liu · Siyang Gao · Miao Xiong · Manling Li · Junxian He

Towards Cost-Effective Reward Guided Text Generation

Ahmad Rashid · Ruotian Wu · Rongqi Fan · Hongliang Li · Agustinus Kristiadi · Pascal Poupart

Synthetic Text Generation for Training Large Language Models via Gradient Matching

Dang Nguyen · Zeman Li · MohammadHossein Bateni · Vahab Mirrokni · Meisam Razaviyayn · Baharan Mirzasoleiman

Nemotron-CORTEXA: Enhancing LLM Agents for Software Engineering Tasks via Improved Localization and Solution Diversity

Atefeh Sohrabizadeh · Jialin Song · Mingjie Liu · Rajarshi Roy · Chankyu Lee · Jonathan Raiman · Bryan Catanzaro

Multi-agent Architecture Search via Agentic Supernet

Guibin Zhang · Luyang Niu · Junfeng Fang · Kun Wang · LEI BAI · Xiang Wang

From Language Models over Tokens to Language Models over Characters

Tim Vieira · Benjamin LeBrun · Mario Giulianelli · Juan Luis Gastaldi · Brian DuSell · John Terilla · Timothy O'Donnell · Ryan Cotterell

Adaptive Localization of Knowledge Negation for Continual LLM Unlearning

Abudukelimu Wuerkaixi · Qizhou Wang · Sen Cui · Wutong Xu · Bo Han · Gang Niu · Masashi Sugiyama · Changshui Zhang

RuleAdapter: Dynamic Rules for training Safety Reward Models in RLHF

Xiaomin Li · Mingye Gao · Zhiwei Zhang · Jingxuan Fan · Weiyu Li

FlatQuant: Flatness Matters for LLM Quantization

Yuxuan Sun · Ruikang Liu · Haoli Bai · Han Bao · Kang Zhao · Yuening Li · JiaxinHu · Xianzhi Yu · Lu Hou · Chun Yuan · Xin Jiang · Wulong Liu · Jun Yao

Mitigating Heterogeneous Token Overfitting in LLM Knowledge Editing

Tianci Liu · Ruirui Li · Zihan Dong · Hui Liu · Xianfeng Tang · Qingyu Yin · Linjun Zhang · Haoyu Wang · Jing Gao

Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding

Jiajun Zhu · Peihao Wang · Ruisi Cai · Jason Lee · Pan Li · Zhangyang “Atlas” Wang

Scaling Inference-Efficient Language Models

Song Bian · Minghao Yan · Shivaram Venkataraman

EvFocus: Learning to Reconstruct Sharp Images from Out-of-Focus Event Streams

Lin Zhu · Xiantao Ma · Xiao Wang · Lizhi Wang · Hua Huang

A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models

Mengyang Sun · Yihao Wang · Tao Feng · Dan Zhang · Yifan Zhu · Jie Tang

KVTuner: Sensitivity-Aware Layer-Wise Mixed-Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference

Xing Li · Zeyu Xing · Yiming Li · Linping Qu · Huiling Zhen · Yiwu Yao · Wulong Liu · Sinno Jialin Pan · Mingxuan Yuan

PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model

Baijiong Lin · Weisen Jiang · Yuancheng Xu · Hao Chen · YINGCONG CHEN

Analytical Construction on Geometric Architectures: Transitioning from Static to Temporal Link Prediction

Yadong Sun · Xiaofeng Cao · Ivor Tsang · Heng Tao Shen

A Dynamical Systems-Inspired Pruning Strategy for Addressing Oversmoothing in Graph Attention Networks

Biswadeep Chakraborty · Harshit Kumar · Saibal Mukhopadhyay

G-Adaptivity: optimised graph-based mesh relocation for finite element methods

James Rowbottom · Georg Maierhofer · Teo Deveney · Eike Müller · Alberto Paganini · Katharina Schratz · Pietro Lió · Carola-Bibiane Schönlieb · Chris Budd

FicGCN: Unveiling the Homomorphic Encryption Efficiency from Irregular Graph Convolutional Networks

Zhaoxuan Kan · Husheng Han · shangyi shi · Tenghui Hua · Hang Lu · Xiaowei Li · Jianan Mu · Xing Hu

Rhomboid Tiling for Geometric Graph Deep Learning

Yipeng Zhang · Longlong Li · Kelin Xia

Primal-Dual Neural Algorithmic Reasoning

Yu He · Ellen Vitercik

A Cognac Shot To Forget Bad Memories: Corrective Unlearning for Graph Neural Networks

Varshita Kolipaka · Akshit Sinha · Debangan Mishra · Sumit Kumar · Arvindh Arun · Shashwat Goel · Ponnurangam Kumaraguru

Balancing Efficiency and Expressiveness: Subgraph GNNs with Walk-Based Centrality

Joshua Southern · Yam Eitan · Guy Bar Shalom · Michael Bronstein · Haggai Maron · Fabrizio Frasca

No Metric to Rule Them All: Toward Principled Evaluations of Graph-Learning Datasets

Corinna Coupette · Jeremy Wayland · Emily Simons · Bastian Rieck

Hyperbolic-PDE GNN: Spectral Graph Neural Networks in the Perspective of A System of Hyperbolic Partial Differential Equations

Juwei Yue · Haikuo Li · Jiawei Sheng · Xiaodong Li · Taoyu Su · Tingwen Liu · Li Guo

Learning Latent Graph Structures and their Uncertainty

Alessandro Manenti · Daniele Zambon · Cesare Alippi

Propagate and Inject: Revisiting Propagation-Based Feature Imputation for Graphs with Partially Observed Features

Daeho Um · Sunoh Kim · Jiwoong Park · Jongin Lim · Seong Jin Ahn · Seulki Park

Unifews: You Need Fewer Operations for Efficient Graph Neural Networks

Ningyi Liao · Zihao Yu · Ruixiao Zeng · Siqiang Luo

iN2V: Bringing Transductive Node Embeddings to Inductive Graphs

Nicolas Lell · Ansgar Scherp

TMetaNet: Topological Meta-Learning Framework for Dynamic Link Prediction

Hao Li · Hao Wan · Yuzhou Chen · Dongsheng Ye · Yulia Gel · Hao Jiang

Aggregation Buffer: Revisiting DropEdge with a New Parameter Block

Dooho Lee · Myeong Kong · Sagad Hamid · Cheonwoo Lee · Jaemin Yoo

Simple Path Structural Encoding for Graph Transformers

Louis Airale · Antonio Longa · Mattia Rigon · Andrea Passerini · Roberto Passerone

Less is More: Federated Graph Learning with Alleviating Topology Heterogeneity from A Causal Perspective

Lele Fu · Bowen Deng · Sheng Huang · Tianchi Liao · Shirui Pan · Chuan Chen

Towards a Unified Framework of Clustering-based Anomaly Detection

Zeyu Fang · Ming Gu · Sheng Zhou · Jiawei Chen · Qiaoyu Tan · Haishuai Wang · Jiajun Bu

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Kevin Rojas · Yuchen Zhu · Sichen Zhu · Felix Ye · Molei Tao

VCT: Training Consistency Models with Variational Noise Coupling

Gianluigi Silvestri · Luca Ambrogioni · Chieh-Hsin Lai · Yuhta Takida · Yuki Mitsufuji

Angle Domain Guidance: Latent Diffusion Requires Rotation Rather Than Extrapolation

Cheng Jin · Zhenyu Xiao · Chutao Liu · Yuantao Gu

Upcycling Text-to-Image Diffusion Models for Multi-Task Capabilities

Ruchika Chavhan · Abhinav Mehrotra · Malcolm Chadwick · Alberto Gil Couto Pimentel Ramos · Luca Morreale · Mehdi Noroozi · Sourav Bhattacharya

Spherical-Nested Diffusion Model for Panoramic Image Outpainting

Xiancheng Sun · Senmao Ma · Shengxi Li · Mai Xu · Jingyuan Xia · Lai Jiang · Xin Deng · Jiali Wang

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

Sucheng Ren · Qihang Yu · Ju He · Xiaohui Shen · Alan Yuille · Liang-Chieh Chen

Inference-Time Alignment of Diffusion Models with Direct Noise Optimization

Zhiwei Tang · Jiangweizhi Peng · Jiasheng Tang · Mingyi Hong · Fan Wang · Tsung-Hui Chang

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Kaiwen Zheng · Yongxin Chen · Huayu Chen · Guande He · Ming-Yu Liu · Jun Zhu · Qinsheng Zhang

Multidimensional Adaptive Coefficient for Inference Trajectory Optimization in Flow and Diffusion

Dohoon Lee · Jaehyun Park · Hyunwoo Kim · Kyogu Lee

VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models

Hila Chefer · Uriel Singer · Amit Zohar · Yuval Kirstain · Adam Polyak · Yaniv Taigman · Lior Wolf · Shelly Sheynin

FlexiClip: Locality-Preserving Free-Form Character Animation

Anant Khandelwal

Sparse Autoencoders, Again?

Yin Lu · Xuening Zhu · Tong He · David Wipf

Ensemble Distribution Distillation via Flow Matching

Jonggeon Park · Giung Nam · Hyunsu Kim · Jongmin Yoon · Juho Lee

RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers

Min Zhao · Guande He · Yixiao Chen · Hongzhou Zhu · Chongxuan Li · Jun Zhu

Generalized Interpolating Discrete Diffusion

Dimitri von Rütte · Janis Fluri · Yuhui Ding · Antonio Orvieto · Bernhard Schölkopf · Thomas Hofmann

How to Move Your Dragon: Text-to-Motion Synthesis for Large-Vocabulary Objects

Wonkwang Lee · Jongwon Jeong · Taehong Moon · Hyeon-Jong Kim · Jaehyeon Kim · Gunhee Kim · Byeong-Uk Lee

Explainable Concept Generation through Vision-Language Preference Learning for Understanding Neural Networks' Internal Representations

Aditya Taparia · Som Sagar · Ransalu Senanayake

Conditional Diffusion Model with Nonlinear Data Transformation for Time Series Forecasting

RISHI JINKA · Venkata Sai Mothish Gonugunta · Deepak N. Subramani

Masked Generative Nested Transformers with Decode Time Scaling

Sahil Goyal · Debapriya Tula · Gagan Jain · Pradeep Shenoy · Prateek Jain · Sujoy Paul

Efficient Generative Modeling with Residual Vector Quantization-Based Tokens

Jaehyeon Kim · Taehong Moon · Keon Lee · Jaewoong Cho

Masked Autoencoders Are Effective Tokenizers for Diffusion Models

Hao Chen · Yujin Han · Fangyi Chen · Xiang Li · Yidong Wang · Jindong Wang · Ze Wang · Zicheng Liu · Difan Zou · Bhiksha Raj

Learning Monotonic Probabilities with a Generative Cost Model

Yongxiang Tang · Yanhua Cheng · Xiaocheng Liu · chenchen Jiao · Yanxiang Zeng · Ning Luo · Pengjia Yuan · Xialong Liu · Peng Jiang

Learning to Quantize for Training Vector-Quantized Networks

Peijia Qin · Jianguo Zhang

The Hidden Life of Tokens: Reducing Hallucination of Large Vision-Language Models Via Visual Information Steering

Zhuowei Li · Haizhou Shi · Yunhe Gao · Di Liu · Zhenting Wang · Yuxiao Chen · Ting Liu · Long Zhao · Hao Wang · Dimitris Metaxas

Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts

Yike Yuan · Ziyu Wang · Zihao Huang · Defa Zhu · Xun Zhou · Jingyi Yu · Qiyang Min

Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning

Wenke Huang · Jian Liang · Zekun Shi · Didi Zhu · Guancheng Wan · He Li · Bo Du · Dacheng Tao · Mang Ye

Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning

Lang Feng · Weihao Tan · Zhiyi Lyu · Longtao Zheng · Haiyang Xu · Ming Yan · Fei Huang · Bo An

What If We Recaption Billions of Web Images with LLaMA-3?

Xianhang Li · Haoqin Tu · Mude Hui · Zeyu Wang · Bingchen Zhao · Junfei Xiao · Sucheng Ren · Jieru Mei · Qing Liu · Huangjie Zheng · Yuyin Zhou · Cihang Xie

BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing

Dongliang Guo · Mengxuan Hu · Zihan Guan · Thomas Hartvigsen · Sheng Li

Reasoning Limitations of Multimodal Large Language Models. A case study of Bongard Problems

Mikołaj Małkiński · Szymon Pawlonka · Jacek Mańdziuk

Bongard in Wonderland: Visual Puzzles that Still Make AI Go Mad?

Antonia Wüst · Tim Woydt · Lukas Helff · Inga Ibs · Wolfgang Stammer · Devendra Dhami · Constantin Rothkopf · Kristian Kersting

Griffin: Towards a Graph-Centric Relational Database Foundation Model

Yanbo Wang · Xiyuan Wang · Quan Gan · Minjie Wang · Qibin Yang · David Wipf · Muhan Zhang

In-Context Learning as Conditioned Associative Memory Retrieval

Weimin Wu · Teng-Yun Hsiao · Jerry Yao-Chieh Hu · Wenxin Zhang · Han Liu

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Yu Sun · Xinhao Li · Karan Dalal · Jiarui Xu · Arjun Vikram · Genghan Zhang · Yann Dubois · Xinlei Chen · Xiaolong Wang · Sanmi Koyejo · Tatsunori Hashimoto · Carlos Guestrin

Efficient Parallel Training Methods for Spiking Neural Networks with Constant Time Complexity

Wanjin Feng · Xingyu Gao · Wenqian Du · Hailong Shi · Peilin Zhao · Pengcheng Wu · Chunyan Miao

LapSum - One Method to Differentiate Them All: Ranking, Sorting and Top-k Selection

Łukasz Struski · Michal Bednarczyk · Igor Podolak · Jacek Tabor

Improving Memory Efficiency for Training KANs via Meta Learning

Zhangchi Zhao · Jun Shu · Deyu Meng · Zongben Xu

Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias

Yuanzhe Hu · Kinshuk Goel · Vlad Killiakov · Yaoqing Yang

A First-order Generative Bilevel Optimization Framework for Diffusion Models

Quan Xiao · Hui Yuan · A F M Saif · Gaowen Liu · Ramana Kompella · Mengdi Wang · Tianyi Chen

Training Deep Learning Models with Norm-Constrained LMOs

Thomas Pethick · Wanyun Xie · Kimon Antonakopoulos · Zhenyu Zhu · Antonio Silveti-Falls · Volkan Cevher

The underlying structures of self-attention: symmetry, directionality, and emergent dynamics in Transformer training

Matteo Saponati · Pascal J. Sager · Pau Vilimelis Aceituno · Thilo Stadelmann · Benjamin F. Grewe

Strategy Coopetition Explains the Emergence and Transience of In-Context Learning

Aaditya Singh · Ted Moskovitz · Sara Dragutinović · Feilx Hill · Stephanie Chan · Andrew Saxe

Selective Prompt Anchoring for Code Generation

Yuan Tian · Tianyi Zhang

On Expressive Power of Looped Transformers: Theoretical Analysis and Enhancement via Timestep Encoding

Kevin Xu · Issei Sato

HashAttention: Semantic Sparsity for Faster Inference

Aditya Desai · Shuo Yang · Alejandro Cuadron · Matei Zaharia · Joseph E Gonzalez · Ion Stoica

Fast Inference with Kronecker-Sparse Matrices

Antoine Gonon · Léon Zheng · Pascal Carrivain · TUNG LE

KIND: Knowledge Integration and Diversion for Training Decomposable Models

Yucheng Xie · Fu Feng · Ruixiao Shi · Jing Wang · Yong Rui · Xin Geng

Dynamic Sparse Training of Diagonally Sparse Networks

Abhishek Tyagi · Arjun Iyer · William Renninger · Christopher Kanan · Yuhao Zhu

High Dynamic Range Novel View Synthesis with Single Exposure

Kaixuan Zhang · HuWang · Minxian Li · Mingwu Ren · Mao Ye · Xiatian Zhu

Avoiding spurious sharpness minimization broadens applicability of SAM

Sidak Pal Singh · Hossein Mobahi · Atish Agarwala · Yann Nicolas Dauphin

On Exact Bit-level Reversible Transformers Without Changing Architecture

Guoqiang Zhang · John Lewis · W. Bastiaan Kleijn

MixMin: Finding Data Mixtures via Convex Minimization

Anvith Thudi · Evianne Rovers · Yangjun Ruan · Tristan Thrush · Chris Maddison

Rethinking the Stability-Plasticity Trade-off in Continual Learning from an Architectural Perspective

Aojun Lu · Hangjie Yuan · Tao Feng · Yanan Sun

How Compositional Generalization and Creativity Improve as Diffusion Models are Trained

Alessandro Favero · Antonio Sclocchi · Francesco Cagnetta · Pascal Frossard · Matthieu Wyart

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis

Xu Wang · Yan Hu · Wenyu Du · Reynold Cheng · Benyou Wang · Difan Zou

Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks

Shikai Qiu · Lechao Xiao · Andrew Wilson · Jeffrey Pennington · Atish Agarwala

How to set AdamW's weight decay as you scale model and dataset size

Xi Wang · Laurence Aitchison

Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems

Maksim Zhdanov · Max Welling · Jan-Willem van de Meent

Stable Fair Graph Representation Learning with Lipschitz Constraint

Qiang Chen · Zhongze Wu · Xiu Su · Xi Lin · Zhe Qu · Shan You · Shuo Yang · Chang Xu

TinyMIG: Transferring Generalization from Vision Foundation Models to Single-Domain Medical Imaging

Chuang Liu · Hongyan Xu · Yichao Cao · Xiu Su · Zhe Qu · Tianfa Li · Shan An · Haogang Zhu

Topological Signatures of Adversaries in Multimodal Alignments

Minh Vu · Geigh Zollicoffer · Huy Mai · Ben Nebgen · Boian S Alexandrov · Manish Bhattarai

The Panaceas for Improving Low-Rank Decomposition in Communication-Efficient Federated Learning

Shiwei Li · Xiandi Luo · Haozhao Wang · Xing Tang · Shijie Xu · weihongluo · Yuhua Li · xiuqiang He · Ruixuan Li

Kona: An Efficient Privacy-Preservation Framework for KNN Classification by Communication Optimization

Guopeng Lin · Ruisheng Zhou · Shuyu Chen · Weili Han · Jin Tan · Wenjing Fang · Lei Wang · Tao Wei

Vision-Language Model Selection and Reuse for Downstream Adaptation

Hao-Zhe Tan · Zhi Zhou · Yu-Feng Li · Lan-Zhe Guo

Universal Approximation of Mean-Field Models via Transformers

Shiba Biswal · Karthik Elamvazhuthi · Rishi Sonthalia

ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think

Tao Feng · Wei Li · Didi Zhu · Hangjie Yuan · Wendi Zheng · Dan Zhang · Jie Tang

Online Curvature-Aware Replay: Leveraging $\mathbf{2^{nd}}$ Order Information for Online Continual Learning

Edoardo Urettini · Antonio Carta

Lightweight-Mark: Rethinking Deep Learning-Based Watermarking

Yupeng Qiu · Han Fang · Ee-Chien Chang

On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation

Nghiem Diep · Huy Nguyen · Chau Nguyen · Minh Le · Duy Nguyen · Daniel Sonntag · Mathias Niepert · Nhat Ho

Incorporating Arbitrary Matrix Group Equivariance into KANs

Lexiang Hu · Yisen Wang · Zhouchen Lin

RollingQ: Reviving the Cooperation Dynamics in Multimodal Transformer

Haotian Ni · Yake Wei · Hang Liu · Gong Chen · Chong Peng · Hao Lin · Di Hu

Position: The Categorization of Race in ML is a Flawed Premise

Miriam Doh · Benedikt Höltgen · Piera Riccio · Nuria Oliver

Position: Certified Robustness Does Not (Yet) Imply Model Security

Andrew C. Cullen · Paul MONTAGUE · Sarah Erfani · Benjamin Rubinstein

Position: AI Safety Must Embrace an Antifragile Perspective

Ming Jin · Hyunin Lee

Position: Constants are Critical in Regret Bounds for Reinforcement Learning

Simone Drago · Marco Mussi · Alberto Maria Metelli

Position: A Theory of Deep Learning Must Include Compositional Sparsity

David A. Danhofer · Davide DAscenzo · Rafael Dubach · Tomaso A Poggio

Position: Lifetime tuning is incompatible with continual reinforcement learning

Golnaz Mesbahi · Parham Mohammad Panahi · Olya Mastikhina · Steven Tang · Martha White · Adam White

Position: Spectral GNNs Rely Less on Graph Fourier Basis than Conceived

Yuhe Guo · Huayi Tang · Jiahong Ma · Hongteng Xu · Zhewei Wei

Position: Language model developers should report train-test overlap

Andy Zhang · Kevin Klyman · Yifan Mai · Yoav Levine · Yian Zhang · Rishi Bommasani · Percy Liang

Position: Not All Explanations for Deep Learning Phenomena Are Equally Valuable

Alan Jeffares · Mihaela van der Schaar

Position: Probabilistic Modelling is Sufficient for Causal Inference

Bruno Mlodozeniec · David Krueger · Richard E Turner

Position: AI Scaling: From Up to Down and Out

Yunke Wang · Yanxi Li · Chang Xu

Position: Future Research and Challenges Remain Towards AI for Software Engineering

Alex Gu · Naman Jain · Wen-Ding Li · Manish Shetty Molahalli · Kevin Ellis · Koushik Sen · Armando Solar-Lezama

Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives

Elliot Meyerson · Xin Qiu

One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework

Feiran Li · Qianqian Xu · Shilong Bao · Zhiyong Yang · Xiaochun Cao · Qingming Huang

ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

Chhavi Yadav · Evan Laufer · Dan Boneh · Kamalika Chaudhuri

Test-Time Multimodal Backdoor Detection by Contrastive Prompting

Yuwei Niu · Shuo He · Qi Wei · Zongyu Wu · Feng Liu · Lei Feng

A Checks-and-Balances Framework for Context-Aware Ethical AI Alignment

Edward Chang

Not All Wrong is Bad: Using Adversarial Examples for Unlearning

Ali Ebrahimpour-Boroojeny · Hari Sundaram · Varun Chandrasekaran

Compositional Condition Question Answering in Tabular Understanding

Jun-Peng Jiang · Tao Zhou · De-Chuan Zhan · Han-Jia Ye

ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning

Zhaorun Chen · Mintong Kang · Bo Li

Leveraging Randomness in Model and Data Partitioning for Privacy Amplification

Andy Dong · Wei-Ning Chen · Ayfer Ozgur

Distributed Differentially Private Data Analytics via Secure Sketching

Jakob Burkhardt · Hannah Keller · Claudio Orlandi · Chris Schwiegelshohn

Identifying and Understanding Cross-Class Features in Adversarial Training

Zeming Wei · Yiwen Guo · Yisen Wang

MTL-UE: Learning to Learn Nothing for Multi-Task Learning

Yi Yu · Song Xia · SIYUAN YANG · Chenqi KONG · Wenhan Yang · Shijian Lu · Yap-peng Tan · Alex Kot

The Jailbreak Tax: How Useful are Your Jailbreak Outputs?

Kristina Nikolić · Luze Sun · Jie Zhang · Florian Tramer

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Lukas Fluri · Leon Lang · Alessandro Abate · Patrick Forré · David Krueger · Joar Skalse

STD-FD: Spatio-Temporal Distribution Fitting Deviation for AIGC Forgery Identification

Hengrui Lou · Zunlei Feng · Jinsong Geng · Erteng Liu · Jie Lei · Lechao Cheng · Jie Song · Mingli Song · Yijun Bei

It's My Data Too: Private ML for Datasets with Multi-User Training Examples

Arun Ganesh · Ryan McKenna · Hugh B McMahan · Adam Smith · Fan Wu

Differentially Private Federated $k$-Means Clustering with Server-Side Data

Jonathan Scott · Christoph Lampert · David Saulpic

PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative APIs

Jianqing Zhang · Yang Liu · Jie Fu · Yang Hua · Tianyuan Zou · Jian Cao · Qiang Yang

An Efficient Private GPT Never Autoregressively Decodes

Zhengyi Li · Yue Guan · Kang Yang · Yu Feng · Ning Liu · Yu Yu · Jingwen Leng · Minyi Guo

Synthetic Face Datasets Generation via Latent Space Exploration from Brownian Identity Diffusion

David Geissbühler · Hatef Otroshi Shahreza · Sébastien Marcel

Go to Event Page

Mentorship

Science Communication 101: How to write an elevator pitch for your research

Julien Besset

11:00 AM - 12:00 PM

Science communication skills are often lacking from academic programs, but knowing how to explain your research effectively will help you when presenting it to your peers, performing in a job interview, or soliciting funding for a project. This hands-on session will give you practical tips and exercises to craft a short, effective and accessible overview of your work for a wide range of audiences and applications.

... more

Poster

Poster Session 1 West

11:00 AM - 1:30 PM

222 Events in this session

Autoformulation of Mathematical Optimization Models Using LLMs

Nicolás Astorga · Tennison Liu · Yuanzhang Xiao · Mihaela van der Schaar

Ladder-Residual: Parallelism-Aware Architecture for Accelerating Large Model Inference with Communication Overlapping

Muru Zhang · Mayank Mishra · Zhongzhu Zhou · William Brandon · Jue Wang · Yoon Kim · Jonathan Ragan-Kelley · Shuaiwen Song · Ben Athiwaratkun · Tri Dao

Delta Decompression for MoE-based LLMs Compression

Hao Gu · Wei Li · Lujun Li · Qiyuan Zhu · Mark Lee · Shengjie Sun · Wei Xue · Yike Guo

Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression

Peijie Dong · Zhenheng Tang · Xiang Liu · Lujun Li · Xiaowen Chu · Bo Li

MoE-SVD: Structured Mixture-of-Experts LLMs Compression via Singular Value Decomposition

Wei Li · Lujun Li · Hao Gu · Youliang Huang · Mark Lee · Shengjie Sun · Wei Xue · Yike Guo

When Every Millisecond Counts: Real-Time Anomaly Detection via the Multimodal Asynchronous Hybrid Network

Dong Xiao · Guangyao Chen · Peixi Peng · Yangru Huang · Yifan Zhao · Yongxing Dai · Yonghong Tian

Compressed and distributed least-squares regression: convergence rates with applications to federated learning

Constantin Philippenko · Aymeric Dieuleveut

AKORN: Adaptive Knots generated Online for RegressioN splines

Sunil Madhow · Dheeraj Baby · Yu-Xiang Wang

Pareto-Optimality, Smoothness, and Stochasticity in Learning-Augmented One-Max-Search

Ziyad Benomar · Lorenzo Croissant · Vianney Perchet · Spyros Angelopoulos

GaussMark: A Practical Approach for Structural Watermarking of Language Models

Adam Block · Alexander Rakhlin · Ayush Sekhari

Identifying Metric Structures of Deep Latent Variable Models

Stas Syrota · Yevgen Zainchkovskyy · Johnny Xi · Benjamin Bloem-Reddy · Søren Hauberg

TAROT: Targeted Data Selection via Optimal Transport

Lan Feng · Fan Nie · Yuejiang Liu · Alexandre Alahi

Core Knowledge Deficits in Multi-Modal Language Models

Yijiang Li · Qingying Gao · Tianwei Zhao · Bingyang Wang · Haoran Sun · Haiyun Lyu · Robert Hawkins · Nuno Vasconcelos · Tal Golan · Dezhi Luo · Hokin Deng

A Sharper Global Convergence Analysis for Average Reward Reinforcement Learning via an Actor-Critic Approach

Swetha Ganesh · Washim Mondal · Vaneet Aggarwal

The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

Jiachen Hu · Rui Ai · Han Zhong · Xiaoyu Chen · Liwei Wang · Zhaoran Wang · Zhuoran Yang

Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization

Chenbei Lu · Laixi Shi · Zaiwei Chen · Chenye Wu · Adam Wierman

Can RLHF be More Efficient with Imperfect Reward Models? A Policy Coverage Perspective

Jiawei Huang · Bingcong Li · Christoph Dann · Niao He

Diffusion models for Gaussian distributions: Exact solutions and Wasserstein errors

Emile Pierret · Bruno Galerne

Average Sensitivity of Hierarchical $k$-Median Clustering

Shijie Li · Weiqiang He · Ruobing Bai · Pan Peng

Learning multivariate Gaussians with imperfect advice

Arnab Bhattacharyya · Davin Choo · Philips George John · Themistoklis Gouleakis

On The Concurrence of Layer-wise Preconditioning Methods and Provable Feature Learning

Thomas T. Zhang · Behrad Moniri · Ansh Nagwekar · Faraz Rahman · Anton Xue · Hamed Hassani · Nikolai Matni

Sparse-pivot: Dynamic correlation clustering for node insertions

Mina Dalirrooyfard · Konstantin Makarychev · Slobodan Mitrovic

Calibrating Video Watch-time Predictions with Credible Prototype Alignment

Chao · Shisong Tang · Fan Li · Jiechao Gao · Hechang Chen

Incremental Gradient Descent with Small Epoch Counts is Surprisingly Slow on Ill-Conditioned Problems

Yujun Kim · Jaeyoung Cha · Chulhee Yun

Adaptive Self-improvement LLM Agentic System for ML Library Development

Genghan Zhang · Weixin Liang · Olivia Hsu · Kunle Olukotun

PEINR: A Physics-enhanced Implicit Neural Representation for High-Fidelity Flow Field Reconstruction

Liming Shen · Liang Deng · Chongke Bi · Yu Wang · Xinhai Chen · Yueqing Wang · Jie Liu

PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations

Benjamin Holzschuh · Qiang Liu · Georg Kohl · Nils Thuerey

Topology-aware Neural Flux Prediction Guided by Physics

Haoyang Jiang · Jindong Wang · Xingquan Zhu · Yi He

OneForecast: A Universal Framework for Global and Regional Weather Forecasting

Yuan Gao · Hao Wu · Ruiqi Shu · huanshuo dong · Fan Xu · Rui Chen · Yibo Yan · Qingsong Wen · Xuming Hu · Kun Wang · Jiahao Wu · Qing Li · Hui Xiong · Xiaomeng Huang

Efficient and Scalable Density Functional Theory Hamiltonian Prediction through Adaptive Sparsity

Erpai Luo · Xinran Wei · Lin Huang · Yunyang Li · Han Yang · Zaishuo Xia · Zun Wang · Chang Liu · Bin Shao · Jia Zhang

WyckoffDiff -- A Generative Diffusion Model for Crystal Symmetry

Filip Ekström Kelvinius · Oskar Andersson · Abhijith Parackal · Dong Qian · Rickard Armiento · Fredrik Lindsten

Zero-Shot Cyclic Peptide Design via Composable Geometric Constraints

Dapeng Jiang · Xiangzhe Kong · Jiaqi Han · Mingyu Li · Rui Jiao · Wenbing Huang · Stefano Ermon · Jianzhu Ma · Yang Liu

Annealing Flow Generative Models Towards Sampling High-Dimensional and Multi-Modal Distributions

Dongze Wu · Yao Xie

Sounding that Object: Interactive Object-Aware Image to Audio Generation

Tingle Li · Baihe Huang · Xiaobin Zhuang · Dongya Jia · Jiawei Chen · Yuping Wang · Zhuo Chen · Gopala Anumanchipalli · Yuxuan Wang

PINNsAgent: Automated PDE Surrogation with Large Language Models

Qingpo Wuwu · Chonghan Gao · Tianyu Chen · Yihang Huang · Yuekai Zhang · Jianing Wang · Jianxin Li · Haoyi Zhou · Shanghang Zhang

Temperature-Annealed Boltzmann Generators

Henrik Schopmans · Pascal Friederich

Arbitrarily-Conditioned Multi-Functional Diffusion for Multi-Physics Emulation

Da Long · Zhitong Xu · Guang Yang · Akil Narayan · Shandian Zhe

A Machine Learning Approach to Duality in Statistical Physics

Prateek Gupta · Andrea Ferrari · Nabil Iqbal

Tensor-Var: Efficient Four-Dimensional Variational Data Assimilation

Yiming Yang · Xiaoyuan Cheng · Daniel Giles · Sibo Cheng · Yi He · Xiao Xue · Boli Chen · Yukun Hu

Multi-Timescale Dynamics Model Bayesian Optimization for Plasma Stabilization in Tokamaks

Rohit Sonker · Alexandre Capone · Andrew Rothstein · Hiro Kaga · Egemen Kolemen · Jeff Schneider

MIPT: Multilevel Informed Prompt Tuning for Robust Molecular Property Prediction

Yeyun Chen · Jiangming Shi

Open Materials Generation with Stochastic Interpolants

Philipp Höllmer · Thomas Egg · Maya Martirossyan · Eric Fuemmeler · Zeren Shui · Amit Gupta · Pawan Prakash · Adrian Roitberg · Mingjie Liu · George Karypis · Mark Transtrum · Richard Hennig · Ellad Tadmor · Stefano Martiniani

On Explaining Equivariant Graph Networks via Improved Relevance Propagation

Hongyi Ling · Haiyang Yu · Zhimeng Jiang · Na Zou · Shuiwang Ji

Relational Invariant Learning for Robust Solvation Free Energy Prediction

Yeyun Chen

Interpolating Neural Network-Tensor Decomposition (INN-TD): a scalable and interpretable approach for large-scale physics-based problems

Jiachen Guo · Xiaoyu Xie · Chanwook Park · Hantao Zhang · Matthew Politis · Gino Domel · Jiachen Guo

HybridGS: High-Efficiency Gaussian Splatting Data Compression using Dual-Channel Sparse Representation and Point Cloud Encoder

Qi Yang · Le Yang · Geert Van der Auwera · Zhu Li

CostFilter-AD: Enhancing Anomaly Detection through Matching Cost Filtering

Zhe Zhang · Mingxiu Cai · Hanxiao Wang · Gaochang Wu · Tianyou Chai · Xiatian Zhu

HetSSNet: Spatial-Spectral Heterogeneous Graph Learning Network for Panchromatic and Multispectral Images Fusion

Mengting Ma · Yizhen Jiang · Mengjiao Zhao · Jiaxin Li · Wei Zhang

MIRROR: Make Your Object-Level Multi-View Generation More Consistent with Training-Free Rectification

TianChi Xing · Bonan Li · Congying Han · XINMIN QIU · Zicheng Zhang · Tiande Guo

Generative Point Cloud Registration

Haobo Jiang · Jin Xie · jian Yang · Liang Yu · Jianmin Zheng

Feature out! Let Raw Image as Your Condition for Blind Face Restoration

XINMIN QIU · Gege Chen · Bonan Li · Congying Han · Tiande Guo · Zicheng Zhang

VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians

Pengchong Hu · Zhizhong Han

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Xilin Wei · Xiaoran Liu · Yuhang Zang · Xiaoyi Dong · Pan Zhang · Yuhang Cao · Jian Tong · Haodong Duan · Qipeng Guo · Jiaqi Wang · Xipeng Qiu · Dahua Lin

ReferSplat: Referring Segmentation in 3D Gaussian Splatting

Shuting He · Guangquan Jie · Changshuo Wang · Yun Zhou · Shuming Hu · Guanbin Li · Henghui Ding

Privacy-Shielded Image Compression: Defending Against Exploitation from Vision-Language Pretrained Models

Xuelin Shen · Jiayin Xu · Kangsheng Yin · Wenhan Yang

Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

Dejia Xu · Yifan Jiang · Chen Huang · Liangchen Song · Thorsten Gernoth · Liangliang Cao · Zhangyang “Atlas” Wang · Hao Tang

Asymmetric Decision-Making in Online Knowledge Distillation: Unifying Consensus and Divergence

zhaowei chen · Borui Zhao · Yuchen Ge · Yuhao Chen · Renjie Song · Jiajun Liang

Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model

SHEN FEI · Cong Wang · Junyao Gao · Qin Guo · Jisheng Dang · Jinhui Tang · Tat-Seng Chua

Efficient Motion Prompt Learning for Robust Visual Tracking

Jie Zhao · Xin Chen · Yongsheng Yuan · Michael Felsberg · Dong Wang · Huchuan Lu

PhySpec: Physically Consistent Spectral Reconstruction via Orthogonal Subspace Decomposition and Self-Supervised Meta-Auxiliary Learning

Xingxing Yang · Jie Chen · Zaifeng Yang

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

Chenyu Li · Oscar Michel · Xichen Pan · Sainan Liu · Mike Roberts · Saining Xie

SafeMap: Robust HD Map Construction from Incomplete Observations

Xiaoshuai Hao · Lingdong Kong · Rong Yin · Pengwei Wang · Jing Zhang · Yunfeng Diao · Shu Zhao

When Model Knowledge meets Diffusion Model: Diffusion-assisted Data-free Image Synthesis with Alignment of Domain and Class

Yujin Kim · Hyunsoo Kim · Hyunwoo Kim · Suhyun Kim

Learning Event Completeness for Weakly Supervised Video Anomaly Detection

Yu Wang · Shiwei Chen

Visual Autoregressive Modeling for Image Super-Resolution

Yunpeng Qu · Kun Yuan · Jinhua Hao · Kai Zhao · Qizhi Xie · Ming Sun · Chao Zhou

Beyond Confidence: Exploiting Homogeneous Pattern for Semi-Supervised Semantic Segmentation

Rui Sun · Huayu Mai · Wangkai Li · Yujia Chen · Naisong Luo · Yuan Wang · Tianzhu Zhang

UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

Qin Guo · Ailing Zeng · Dongxu Yue · Ceyuan Yang · Yang Cao · Hanzhong Guo · SHEN FEI · Wei Liu · Xihui Liu · Dan Xu

Orthogonal Subspace Decomposition for Generalizable AI-Generated Image Detection

Zhiyuan Yan · Jiangming Wang · Peng Jin · Ke-Yue Zhang · Chengchun Liu · Shen Chen · Taiping Yao · Shouhong Ding · Baoyuan Wu · Li Yuan

Unifying 2D and 3D Vision-Language Understanding

Ayush Jain · Alexander Swerdlow · Yuzhou Wang · Sergio Arnaud · Ada Martin · Alexander Sax · Franziska Meier · Katerina Fragkiadaki

DyPolySeg: Taylor Series-Inspired Dynamic Polynomial Fitting Network for Few-shot Point Cloud Semantic Segmentation

Changshuo Wang · Xiang Fang · Prayag Tiwari

sciLaMA: A Single-Cell Representation Learning Framework to Leverage Prior Knowledge from Large Language Models

Hongru Hu · Shuwen Zhang · Yongin Choi · Venkat Malladi · Gerald Quon

CellFlux: Simulating Cellular Morphology Changes via Flow Matching

Yuhui Zhang · Yuchang Su · Chenyu Wang · Tianhong Li · Zoe Wefers · Jeffrey J. Nirschl · James Burgess · Daisy Yi Ding · Alejandro Lozano · Emma Lundberg · Serena Yeung

Raptor: Scalable Train-Free Embeddings for 3D Medical Volumes Leveraging Pretrained 2D Foundation Models

Ulzee An · Moonseong Jeong · Simon Lee · Aditya Gorla · Yuzhe Yang · Sriram Sankararaman

All-atom inverse protein folding through discrete flow matching

Kai Yi · Kiarash Jamali · Sjors Scheres

Aligning Protein Conformation Ensemble Generation with Physical Feedback

Jiarui Lu · Xiaoyin Chen · Stephen Lu · Aurelie Lozano · Vijil Chenthamarakshan · Payel Das · Jian Tang

CombiMOTS: Combinatorial Multi-Objective Tree Search for Dual-Target Molecule Generation

Thibaud Southiratn · Bonil Koo · Yijingxiu Lu · Sun Kim

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Yuxin Zuo · Shang Qu · Yifei Li · Zhang-Ren Chen · Xuekai Zhu · Ermo Hua · Kaiyan Zhang · Ning Ding · Bowen Zhou

UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design

Xiangzhe Kong · Zishen Zhang · Ziting Zhang · Rui Jiao · Jianzhu Ma · Wenbing Huang · Kai Liu · Yang Liu

Identifying biological perturbation targets through causal differential networks

Menghua Wu · Umesh Padia · Sean Murphy · Regina Barzilay · Tommi Jaakkola

Learning Invariant Causal Mechanism from Vision-Language Models

Zeen Song · Siyu Zhao · Xingyu Zhang · Jiangmeng Li · Changwen Zheng · Wenwen Qiang

BounDr.E: Predicting Drug-likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization

Dongmin Bang · Inyoung Sung · Yinhua Piao · Sangseon Lee · Sun Kim

Multi-Marginal Stochastic Flow Matching for High-Dimensional Snapshot Data at Irregular Time Points

Justin Lee · Behnaz Moradi-Jamei · Heman Shakeri

InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference

Tianyu Cui · Song-Jun Xu · Artem Moskalev · Shuwei Li · Tommaso Mansi · Mangal Prakash · Rui Liao

A Variational Perspective on Generative Protein Fitness Optimization

Lea Bogensperger · Dominik Narnhofer · Ahmed Allam · Konrad Schindler · Michael Krauthammer

Do Multiple Instance Learning Models Transfer?

Daniel Shao · Richard Chen · Andrew Song · Joel Runevic · Ming Y. Lu · Tong Ding · Faisal Mahmood

Reliable Algorithm Selection for Machine Learning-Guided Design

Clara Fannjiang · Ji Won Park

"Why Is There a Tumor?": Tell Me the Reason, Show Me the Evidence

Mengmeng Ma · Tang Li · Yunxiang Peng · LIN LU · Volkan Beylergil · Binsheng Zhao · Oguz Akin · Xi Peng

ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy

Kian Kenyon-Dean · Zitong Jerry Wang · John Urbanik · Konstantin Donhauser · Jason Hartford · Saber Saberian · Nil Sahin · Ihab Bendidi · Safiye Celik · Juan Vera · Marta Fay · Imran Haque · Oren Kraus

Visual and Domain Knowledge for Professional-level Graph-of-Thought Medical Reasoning

Rina Bao · Shilong Dong · Zhenfang Chen · Sheng He · Patricia Ellen Grant · Yangming Ou

A Cross Modal Knowledge Distillation & Data Augmentation Recipe for Improving Transcriptomics Representations through Morphological Features

Ihab Bendidi · Yassir El Mesbahi · Alisandra Denton · Karush Suri · Kian Kenyon-Dean · Auguste Genovesio · Emmanuel Noutahi

Unified Screening for Multiple Diseases

Yiğit Narter · Alihan Hüyük · Mihaela van der Schaar · Cem Tekin

BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models

Susan Liang · Dejan Markovic · Israel D. Gebru · Steven Krenn · Todd Keebler · Jacob Sandakly · Frank Yu · Samuel Hassel · Chenliang Xu · Alexander Richard

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Perampalli Shravan Nayak · Xiangru Jian · Kevin Qinghong Lin · Juan A. Rodriguez · Montek Kalsi · Nicolas Chapados · M. Özsu · Aishwarya Agrawal · David Vazquez · Christopher Pal · Perouz Taslakian · Spandana Gella · Sai Rajeswar Mudumba

Context is Key: A Benchmark for Forecasting with Essential Textual Information

Andrew Williams · Arjun Ashok · Étienne Marcotte · Valentina Zantedeschi · Jithendaraa Subramanian · Roland Riachi · James Requeima · Alexandre Lacoste · Irina Rish · Nicolas Chapados · Alexandre Drouin

CMoS: Rethinking Time Series Prediction Through the Lens of Chunk-wise Spatial Correlations

Haotian Si · Changhua Pei · Jianhui LI · Dan Pei · Gaogang Xie

TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning

Ron Shapira Weber · shahar benishay · Andrey Lavrinenko · Shahaf E. Finder · Oren Freifeld

LOB-Bench: Benchmarking Generative AI for Finance - an Application to Limit Order Book Data

Peer Nagy · Sascha Frey · Kang Li · Bidipta Sarkar · Svitlana Vyetrenko · Stefan Zohren · Anisoara Calinescu · Jakob Foerster

ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

Tianci Bu · Le Zhou · Wenchuan Yang · Jianhong Mou · Kang Yang · Suoyi Tan · Feng Yao · Jingyuan Wang · Xin Lu

Efficient Personalized Adaptation for Physiological Signal Foundation Model

Chenrui Wu · Haishuai Wang · Xiang Zhang · Chengqi Zhang · Jiajun Bu

Towards Learning to Complete Anything in Lidar

Ayça Takmaz · Cristiano Saltori · Neehar Peri · Tim Meinhardt · Riccardo de Lutio · Laura Leal-Taixé · Aljosa Osep

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations

Yucheng Hu · Yanjiang Guo · Pengchao Wang · Xiaoyu Chen · Yen-Jen Wang · Jianke Zhang · Koushil Sreenath · Chaochao Lu · Jianyu Chen

OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction

Huang Huang · Fangchen Liu · Letian Fu · Tingfan Wu · Mustafa Mukadam · Jitendra Malik · Ken Goldberg · Pieter Abbeel

SAH-Drive: A Scenario-Aware Hybrid Planner for Closed-Loop Vehicle Trajectory Generation

Yuqi Fan · Zhiyong Cui · Zhenning Li · Yilong Ren · Haiyang Yu

DexScale: Automating Data Scaling for Sim2Real Generalizable Robot Control

Guiliang Liu · Yueci Deng · Runyi Zhao · Huayi Zhou · Jian Chen · Jietao Chen · Ruiyan Xu · Yunxin Tai · Kui Jia

Self-cross Feature based Spiking Neural Networks for Efficient Few-shot Learning

Qi Xu · Junyang Zhu · Dongdong Zhou · Hao Chen · Yang Liu · Jiangrong Shen · Qiang Zhang

Testing the Limits of Fine-Tuning for Improving Visual Cognition in Vision Language Models

Luca M. Schulze Buschoff · Konstantinos Voudouris · Elif Akata · Matthias Bethge · Josh Tenenbaum · Eric Schulz

Pre-Training Graph Contrastive Masked Autoencoders are Strong Distillers for EEG

Xinxu Wei · kanhao zhao · Yong Jiao · Hua Xie · Lifang He · Yu Zhang

Faster and Stronger: When ANN-SNN Conversion Meets Parallel Spiking Calculation

Zecheng Hao · Qichao Ma · Kang Chen · Yi Zhang · Zhaofei Yu · Tiejun Huang

Fleet of Agents: Coordinated Problem Solving with Large Language Models

Lars Klein · Nearchos Potamitis · Roland Aydin · Robert West · Caglar Gulcehre · Akhil Arora

A Reasoning-Based Approach to Cryptic Crossword Clue Solving

Martin Andrews · Sam Witteveen

MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition

Sungnyun Kim · Kangwook Jang · Sangmin Bae · Sungwoo Cho · Se-Young Yun

Policy Filtration for RLHF to Mitigate Noise in Reward Models

Chuheng Zhang · Wei Shen · Li Zhao · Xuyun Zhang · Xiaolong Xu · Wanchun Dou · Jiang Bian

LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs – No Silver Bullet for LC or RAG Routing

Kuan Li · Liwen Zhang · Yong Jiang · Pengjun Xie · Fei Huang · Shuai Wang · Minhao Cheng

Feature-Mapping Topology Optimization with Neural Heaviside Signed Distance Functions

Aleksandr Kolomeitsev · ANH-HUY PHAN

Universal Biological Sequence Reranking for Improved De Novo Peptide Sequencing

Zijie Qiu · Jiaqi Wei · Xiang Zhang · Sheng Xu · Kai Zou · Zhi Jin · ZhiQiang Gao · Nanqing Dong · Siqi Sun

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Yiheng Xu · Zekun Wang · Junli Wang · Dunjie Lu · Tianbao Xie · Amrita Saha · Doyen Sahoo · Tao Yu · Caiming Xiong

LaMAGIC2: Advanced Circuit Formulations for Language Model-Based Analog Topology Generation

Chen-Chia Chang · Wan-Hsuan Lin · Yikang Shen · Yiran Chen · Xin Zhang

Geometry-Informed Neural Networks

Arturs Berzins · Andreas Radler · Eric Volkmann · Sebastian Sanokowski · Sepp Hochreiter · Johannes Brandstetter

Flat-LoRA: Low-Rank Adaptation over a Flat Loss Landscape

Tao Li · Zhengbao He · Yujun Li · Yasheng Wang · Lifeng Shang · Xiaolin Huang

Efficient Distributed Optimization under Heavy-Tailed Noise

Su Hyeong Lee · Manzil Zaheer · Tian Li

Online Conformal Prediction via Online Optimization

Felipe Areces · Christopher Mohri · Tatsunori Hashimoto · John Duchi

Learning to Generate Projections for Reducing Dimensionality of Heterogeneous Linear Programming Problems

Tomoharu Iwata · Shinsaku Sakaue

Schwarz–Schur Involution: Lightspeed Differentiable Sparse Linear Solvers

Yu Wang · Mazdak Abulnaga · Yaël Balbastre · Bruce Fischl

Decoupled SGDA for Games with Intermittent Strategy Communication

Ali Zindari · Parham Yazdkhasti · Anton Rodomanov · Tatjana Chavdarova · Sebastian Stich

Momentum-Driven Adaptivity: Towards Tuning-Free Asynchronous Federated Learning

Wenjing Yan · Xiangyu Zhong · Xiaolu Wang · Angela Yingjun Zhang

Lean and Mean Adaptive Optimization via Subset-Norm and Subspace-Momentum with Convergence Guarantees

Thien Nguyen · Huy Nguyen

Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Youran Dong · Junfeng Yang · Wei Yao · Jin Zhang

Automatic Differentiation of Optimization Algorithms with Time-Varying Updates

Sheheryar Mehmood · Peter Ochs

Secant Line Search for Frank-Wolfe Algorithms

Deborah Hendrych · Sebastian Pokutta · Mathieu Besançon · David Martinez-Rubio

Active Learning of Deep Neural Networks via Gradient-Free Cutting Planes

Erica Zhang · Fangzhao Zhang · Mert Pilanci

Scalable Approximation Algorithms for $p$-Wasserstein Distance and Its Variants

Nathaniel Lahn · Sharath Raghvendra · Emma Saarinen · Pouyan Shirzadian

SHIELD: Multi-task Multi-distribution Vehicle Routing Solver with Sparsity and Hierarchy

Yong Liang Goh · Zhiguang Cao · Yining Ma · Jianan Zhou · Mohammed Haroon Dupty · Wee Sun Lee

An Asymptotically Optimal Approximation Algorithm for Multiobjective Submodular Maximization at Scale

Fabian Spaeh · Atsushi Miyauchi

Discrepancy Minimization in Input-Sparsity Time

Yichuan Deng · Xiaoyu Li · Zhao Song · OMRI WEINSTEIN

Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces

ERIC EATON · Marcel Hussing · Michael Kearns · Aaron Roth · Sikata Sengupta · Jessica Sorrell

Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning

Yuhui Wang · Qingyuan Wu · Dylan Ashley · Francesco Faccio · Weida Li · Chao Huang · Jürgen Schmidhuber

Action-Constrained Imitation Learning

Chia-Han Yeh · Tse-Sheng Nan · Risto Vuorio · Wei Hung · Hung-Yen Wu · Shao-Hua Sun · Ping-Chun Hsieh

A Reduction Framework for Distributionally Robust Reinforcement Learning under Average Reward

Zachary Roch · George Atia · Yue Wang

Off-Policy Actor-Critic for Adversarial Observation Robustness: Virtual Alternative Training via Symmetric Policy Evaluation

Kosuke Nakanishi · Akihiro Kubo · Yuji Yasui · Shin Ishii

Adaptive Exploration for Multi-Reward Multi-Policy Evaluation

Alessio Russo · Aldo Pacchiano

A Sub-Problem Quantum Alternating Operator Ansatz for Correlation Clustering

Lucas Fabian Naumann · Jannik Irmai · Bjoern Andres

Towards Universal Offline Black-Box Optimization via Learning Language Model Embeddings

Rong-Xi Tan · Ming Chen · Ke Xue · Yao Wang · Yaoyuan Wang · Fu Sheng · Chao Qian

Improved Lower Bounds for First-order Stochastic Non-convex Optimization under Markov Sampling

Zhenyu Sun · Ermin Wei

A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models

Yiming Chen · yuan zhang · Yin Liu · Kun Yuan · Zaiwen Wen

Convergence of Mean-Field Langevin Stochastic Descent-Ascent for Distributional Minimax Optimization

Zhangyi Liu · Feng Liu · Rui Gao · Shuang Li

Optimal Transport Barycenter via Nonconvex-Concave Minimax Optimization

Kaheon Kim · Rentian Yao · Changbo Zhu · Xiaohui Chen

A Novel Characterization of the Population Area Under the Risk Coverage Curve (AURC) and Rates of Finite Sample Estimators

Han Zhou · dr. Jordy Van Landeghem · Teodora Popordanoska · Matthew B Blaschko

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

Arsalan Sharifnassab · Saber Salehkaleybar · Rich Sutton

Constant Stepsize Local GD for Logistic Regression: Acceleration by Instability

Michael Crawshaw · Blake Woodworth · Mingrui Liu

Distributed Retraction-Free and Communication-Efficient Optimization on the Stiefel Manifold

Yilong Song · Peijin Li · Bin Gao · Kun Yuan

Ringmaster ASGD: The First Asynchronous SGD with Optimal Time Complexity

Artavazd Maranjyan · Alexander Tyurin · Peter Richtarik

FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning

Ganyu Wang · Jinjie Fang · Maxwell (Juncheng) Yin · Bin Gu · Xi Chen · Boyu Wang · Yi Chang · Charles X. Ling

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Xinyi Wan · Penghui Qi · Guangxing Huang · Min Lin · Jialin Li

FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training

Filipp Zmushko · Aleksandr Beznosikov · Martin Takac · Samuel Horváth

Riemannian Diffusion Adaptation for Distributed Optimization on Manifolds

Xiuheng Wang · Ricardo Borsoi · Cédric Richard · Ali Sayed

Differentiable Quadratic Optimization For the Maximum Independent Set Problem

Ismail Alkhouri · Cedric Le Denmat · Yingjie Li · CUNXI YU · Jia (Kevin) Liu · Rongrong Wang · Alvaro Velasquez

QPRL : Learning Optimal Policies with Quasi-Potential Functions for Asymmetric Traversal

Jumman Hossain · Nirmalya Roy

In-Context Reinforcement Learning From Suboptimal Historical Data

Juncheng Dong · Moyang Guo · Ethan Fang · Zhuoran Yang · Vahid Tarokh

Calibrated Value-Aware Model Learning with Probabilistic Environment Models

Claas Voelcker · Anastasiia Pedan · Arash Ahmadian · Romina Abachi · Igor Gilitschenski · Amir-massoud Farahmand

Time-Aware World Model for Adaptive Prediction and Control

Anh Nhu · Sanghyun Son · Ming Lin

Pessimism Principle Can Be Effective: Towards a Framework for Zero-Shot Transfer Reinforcement Learning

Chi Zhang · Ziying Jia · George Atia · Sihong He · Yue Wang

Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning

Armin Behnamnia · Gholamali Aminian · Alireza Aghaei · Chengchun Shi · Vincent Tan · Hamid R Rabiee

Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning

Dongsu Lee · Minhae Kwon

Reflect-then-Plan: Offline Model-Based Planning through a Doubly Bayesian Lens

Jihwan Jeong · Xiaoyu Wang · Jingmin Wang · Scott Sanner · Pascal Poupart

Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning

Adrià López Escoriza · Nicklas Hansen · Stone Tao · Tongzhou Mu · Hao Su

Trajectory World Models for Heterogeneous Environments

Shaofeng Yin · Jialong Wu · Siqiao Huang · Xingjian Su · he · Jianye Hao · Mingsheng Long

Stealing That Free Lunch: Exposing the Limits of Dyna-Style Reinforcement Learning

Brett Barkley · David Fridovich-Keil

The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks

Walter Mayor · Johan Obando-Ceron · Aaron Courville · Pablo Samuel Castro

Robust Reward Alignment via Hypothesis Space Batch Cutting

Zhixian Xie · Haode Zhang · Yizhe Feng · Wanxin Jin

Diversifying Robot Locomotion Behaviors with Extrinsic Behavioral Curiosity

Zhenglin Wan · Xingrui Yu · David Bossens · Yueming LYU · Qing Guo · Flint Xiaofeng Fan · Yew Soon ONG · Ivor Tsang

Agent-Centric Actor-Critic for Asynchronous Multi-Agent Reinforcement Learning

Whiyoung Jung · Sunghoon Hong · Deunsol Yoon · Kanghoon Lee · Woohyung Lim

Ad-Hoc Human-AI Coordination Challenge

Tin Dizdarevic · Ravi Hammond · Tobias Gessler · Anisoara Calinescu · Jonathan Cook · Matteo Gallici · Andrei Lupu · Jakob Foerster

Revisiting Cooperative Off-Policy Multi-Agent Reinforcement Learning

yueheng li · Guangming Xie · Zongqing Lu

R3DM: Enabling Role Discovery and Diversity Through Dynamics Models in Multi-agent Reinforcement Learning

Harsh Goel · Mohammad Omama · Behdad Chalaki · Vaishnav Tadiparthi · Ehsan Moradi Pari · Sandeep Chinchali

AssistanceZero: Scalably Solving Assistance Games

Cassidy Laidlaw · Eli Bronstein · Timothy Guo · Dylan Feng · Lukas Berglund · Justin Svegliato · Stuart Russell · Anca Dragan

Goal-Space Planning with Subgoal Models

Chunlok Lo · Kevin Roice · Parham Mohammad Panahi · Scott Jordan · Adam White · Gabor Mihucz · Farzane Aminmansour · Martha White

Faster Approximation Algorithms for k-Center via Data Reduction

Arnold Filtser · Shaofeng Jiang · Yi Li · Anurag Murty Naredla · Ioannis Psarros · Qiaoyuan Yang · Qin Zhang

Private Lossless Multiple Release

Joel Daniel Andersson · Lukas Retschmeier · Boel Nelson · Rasmus Pagh

Batch List-Decodable Linear Regression via Higher Moments

Ilias Diakonikolas · Daniel Kane · Sushrut Karmalkar · Sihan Liu · Thanasis Pittas

Model Uncertainty Quantification by Conformal Prediction in Continual Learning

Rui Gao · Weiwei Liu

Understanding the Forgetting of (Replay-based) Continual Learning via Feature Learning: Angle Matters

Hongyi Wan · Shiyuan Ren · Wei Huang · Miao Zhang · Xiang Deng · Yixin Bao · Liqiang Nie

Uniform Mean Estimation for Heavy-Tailed Distributions via Median-of-Means

Mikael Møller Høgsgaard · Andrea Paudice

On the Statistical Mechanisms of Distributional Compositional Generalization

Jingwen Fu · Nanning Zheng

Securing Equal Share: A Principled Approach for Learning Multiplayer Symmetric Games

Jiawei Ge · Yuanhao Wang · Wenzhe Li · Chi Jin

Self-Play $Q$-Learners Can Provably Collude in the Iterated Prisoner's Dilemma

Quentin Bertrand · Juan Duque · Emilio Calvano · Gauthier Gidel

Fraud-Proof Revenue Division on Subscription Platforms

Abheek Ghosh · Tzeh Yuan Neoh · Nicholas Teh · Giannis Tyrovolas

The impact of uncertainty on regularized learning in games

Pierre-Louis Cauvin · Davide Legacci · Panayotis Mertikopoulos

Solving Zero-Sum Convex Markov Games

Fivos Kalogiannis · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Ian Gemp · Georgios Piliouras

Preference-CFR: Beyond Nash Equilibrium for Better Game Strategies

Qi Ju · Thomas Tellier · Meng Sun · Zhemei Fang · YunFeng Luo

Settling the Maximin Share Fairness for Scheduling among Groups of Machines

Bo Li · Fangxiao WANG · Xing Shiji

Contract Design Under Approximate Best Responses

Francesco Bacchiocchi · Jiarui Gan · Matteo Castiglioni · Alberto Marchesi · Nicola Gatti

COSDA: Counterfactual-based Susceptibility Risk Framework for Open-Set Domain Adaptation

Wenxu Wang · Rui Zhou · Jing Wang · Yun Zhou · Cheng Zhu · Ruichun Tang · Bo Han · Nevin Zhang

The Role of Sparsity for Length Generalization in LLMs

Noah Golowich · Samy Jelassi · David Brandfonbrener · Sham Kakade · Eran Malach

Optimal Transfer Learning for Missing Not-at-Random Matrix Completion

Akhil Jalan · Yassir Jedra · Arya Mazumdar · Soumendu Sundar Mukherjee · Purnamrita Sarkar

When Can Proxies Improve the Sample Complexity of Preference Learning?

Yuchen Zhu · Daniel Augusto de Souza · Zhengyan Shi · Mengyue Yang · Pasquale Minervini · Matt Kusner · Alexander D'Amour

Representations Shape Weak-to-Strong Generalization: Theoretical Insights and Empirical Predictions

Yihao Xue · Jiping Li · Baharan Mirzasoleiman

Time to Spike? Understanding the Representational Power of Spiking Neural Networks in Discrete Time

Duc Anh Nguyen · Ernesto Araya · Adalbert Fono · Gitta Kutyniok

Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment

Yifan Zhang · Ge Zhang · Yue Wu · Kangping Xu · Quanquan Gu

Provable In-Context Vector Arithmetic via Retrieving Task Concepts

Dake Bu · Wei Huang · Andi Han · Atsushi Nitanda · Qingfu Zhang · Hau-San Wong · Taiji Suzuki

Algorithm Development in Neural Networks: Insights from the Streaming Parity Task

Loek van Rossem · Andrew Saxe

The Role of Randomness in Stability

Max Hopkins · Shay Moran

Approximation to Smooth Functions by Low-Rank Swish Networks

Zimeng Li · Hongjun LI · Jingyuan Wang · Ke Tang

On the Learnability of Distribution Classes with Adaptive Adversaries

Tosca Lechner · Alex Bie · Gautam Kamath

On the Training Convergence of Transformers for In-Context Classification of Gaussian Mixtures

Wei Shen · Ruida Zhou · Jing Yang · Cong Shen

On Learning Parallel Pancakes with Mostly Uniform Weights

Ilias Diakonikolas · Daniel Kane · Sushrut Karmalkar · Jasper Lee · Thanasis Pittas

How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias

Ruiquan Huang · Yingbin LIANG · Jing Yang

Representation Preserving Multiclass Agnostic to Realizable Reduction

Steve Hanneke · Qinglin Meng · Amirreza Shaeiri

Constrained Pareto Set Identification with Bandit Feedback

Cyrille Kone · Emilie Kaufmann · Laura Richert

Multi-Armed Bandits with Interference: Bridging Causal Inference and Adversarial Bandits

Su Jia · Peter Frazier · Nathan Kallus

A Trichotomy for List Transductive Online Learning

Steve Hanneke · Amirreza Shaeiri

Offline Learning for Combinatorial Multi-armed Bandits

Xutong Liu · Xiangxiang Dai · Jinhang Zuo · Siwei Wang · Carlee Joe-Wong · John C. S. Lui · Wei Chen

Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback

Qiwei Di · Jiafan He · Quanquan Gu

Tracking Most Significant Shifts in Infinite-Armed Bandits

Joe Suk · Jung-hun Kim

Online Learning in the Random-Order Model

Martino Bernasconi · Andrea Celli · Riccardo Colini Baldeschi · Federico Fusco · Stefano Leonardi · Matteo Russo

Optimal and Practical Batched Linear Bandit Algorithm

Sanghoon Yu · Min-hwan Oh

Contextual Online Decision Making with Infinite-Dimensional Functional Regression

Haichen Hu · Rui Ai · Stephen Bates · David Simchi-Levi

Optimal Algorithm for Max-Min Fair Bandit

Zilong Wang · Zhiyao Zhang · Shuai Li

The Batch Complexity of Bandit Pure Exploration

Adrienne Tuynman · Rémy Degenne

Adaptive Sample Sharing for Multi Agent Linear Bandits

Hamza Cherkaoui · Merwan Barlier · Igor Colin

Policy Optimization for CMDPs with Bandit Feedback: Learning Stochastic and Adversarial Constraints

Francesco Emanuele Stradi · Anna Lunghi · Matteo Castiglioni · Alberto Marchesi · Nicola Gatti

Contextual Linear Bandits with Delay as Payoff

Mengxiao Zhang · Yingfei Wang · Haipeng Luo

Leveraging Model Guidance to Extract Training Data from Personalized Diffusion Models

Xiaoyu Wu · Jiaru Zhang · Steven Wu

Go to Event Page

Invited Talk

Generative AI's Collision with Copyright Law

Pamela Samuelson

2:00 PM - 3:00 PM

The development of generative AI models has understandably caused considerable excitement among machine learning professionals. Few have paid attention to the potential copyright implications of using massive amounts of data publicly available on the Internet to train these models. Commercial developers in the U.S. have expressed confidence that the copyright doctrine of fair use would shield them from liability. In the EU, recently adopted text and data mining exceptions seemed to legalize generative AI training. Israel and Japan have similar rules. But with more than forty copyright-related lawsuits pending against the largest generative AI developers in the U.S. and now a few in Canada, and with the EU and UK aiming to require compliance with their laws, copyright is looming large in the future of generative AI developers. While it is seemingly impossible to create a global licensing regime that would cover all uses of all in-copyright works as training data, proposals to establish collective licensing regimes are under discussion in the EU, UK, and U.S. The machine learning community needs to understand enough about these copyright debates to participate meaningfully in shaping legal environments that will foster innovation in this field, support scientific research, create socially valuable tools, and treat works and their authors with respect.

... more

Speaker Bio

Pamela Samuelson is the Richard M. Sherman Distinguished Professor of Law and Information at the University of California, Berkeley. She is recognized as a pioneer in digital copyright law, intellectual property, cyberlaw and information policy. Since 1996, she has held a joint appointment at Berkeley Law School and UC Berkeley’s School of Information. Samuelson is a director of the internationally-renowned Berkeley Center for Law & Technology. She is co-founder and chair of the board of Authors Alliance, a nonprofit organization that promotes the public interest in access to knowledge. She also serves on the board of directors of the Electronic Frontier Foundation, as well as on the advisory boards for the Electronic Privacy Information Center , the Center for Democracy & Technology, and Public Knowledge.

... more

Oral

Oral 2C Reinforcement Learning

3:30 PM - 4:30 PM

4 Events in this session

Controlling Underestimation Bias in Constrained Reinforcement Learning for Safe Exploration

Shiqing Gao · Jiaxin Ding · Luoyi Fu · Xinbing Wang

Temporal Difference Flows

Jesse Farebrother · Matteo Pirotta · Andrea Tirinzoni · REMI MUNOS · Alessandro Lazaric · Ahmed Touati

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma · Lu Li · Zilin Wang · Li Shen · Pierre-Luc Bacon · Dacheng Tao

Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination

Kunal Jha · Wilka Carvalho · Yancheng Liang · Simon Du · Max Kleiman-Weiner · Natasha Jaques

Go to Event Page

Oral

Oral 2B Positions: AI Regulation and Safety

3:30 PM - 4:30 PM

4 Events in this session

Position: Generative AI Regulation Can Learn from Social Media Regulation

Ruth Elisabeth Appel

Position: Current Model Licensing Practices are Dragging Us into a Quagmire of Legal Noncompliance

Moming Duan · Mingzhe Du · Rui Zhao · Mengying Wang · Yinghui Wu · Nigel Shadbolt · Bingsheng He

Position: AI Agents Need Authenticated Delegation

Tobin South · Samuele Marro · Thomas Hardjono · Robert Mahari · Cedric Whitney · Alan Chan · Alex Pentland

Position: AI Safety should prioritize the Future of Work

Sanchaita Hazra · Bodhisattwa Prasad Majumder · Tuhin Chakrabarty

Go to Event Page

Oral

Oral 2A Diffusion Models

3:30 PM - 4:30 PM

4 Events in this session

DeFoG: Discrete Flow Matching for Graph Generation

Yiming Qin · Manuel Madeira · Dorina Thanou · Pascal Frossard

MGD$^3$ : Mode-Guided Dataset Distillation using Diffusion Models

Jeffrey A. Chan-Santiago · praveen tirupattur · Gaurav Kumar Nayak · Gaowen Liu · Mubarak Shah

Inductive Moment Matching

Linqi (Alex) Zhou · Stefano Ermon · Jiaming Song

Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

Jaeyeon Kim · Kulin Shah · Vasilis Kontonis · Sham Kakade · Sitan Chen

Go to Event Page

Oral

Oral 2E Optimal Transport

3:30 PM - 4:30 PM

4 Events in this session

Hierarchical Refinement: Optimal Transport to Infinity and Beyond

Peter Halmos · Julian Gold · Xinhao Liu · Benjamin Raphael

Fully Dynamic Euclidean Bi-Chromatic Matching in Sublinear Update Time

Gramoz Goranci · Peter Kiss · Neel Patel · Martin Seybold · Eva Szilagyi · Da Wei Zheng

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Clément Bonet · Christophe Vauthier · Anna Korba

Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

Antoine Wehenkel · Juan L. Gamella · Ozan Sener · Jens Behrmann · Guillermo Sapiro · Jörn Jacobsen · Marco Cuturi

Go to Event Page

Oral

Oral 2D Efficient ML

3:30 PM - 4:30 PM

4 Events in this session

AdaSplash: Adaptive Sparse Flash Attention

Nuno Gonçalves · Marcos V. Treviso · Andre Martins

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Nadav Timor · Jonathan Mamou · Daniel Korat · Moshe Berchansky · Gaurav Jain · Oren Pereg · Moshe Wasserblat · David Harel

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Alec Helbling · Tuna Han Salih Meral · Benjamin Hoover · Pinar Yanardag · Polo Chau

Mixture of Lookup Experts

Shibo Jie · Yehui Tang · Kai Han · Yitong Li · Duyu Tang · Zhi-Hong Deng · Yunhe Wang

Go to Event Page

Poster

Poster Session 2 West

4:30 PM - 7:00 PM

214 Events in this session

EGPlace: An Efficient Macro Placement Method via Evolutionary Search with Greedy Repositioning Guided Mutation

ji deng · Zhao Li · Ji Zhang · Jun Gao

Depth Degeneracy in Neural Networks: Vanishing Angles in Fully Connected ReLU Networks on Initialization

Cameron Jakub · Mihai Nica

Exact Recovery of Sparse Binary Vectors from Generalized Linear Measurements

Arya Mazumdar · Neha Sangwan

How Distributed Collaboration Influences the Diffusion Model Training? A Theoretical Perspective

Jing Qiao · Yu Liu · YUAN YUAN · Xiao Zhang · Zhipeng Cai · Dongxiao Yu

The Price of Linear Time: Error Analysis of Structured Kernel Interpolation

Alexander Moreno · Justin Xiao · Jonathan Mei

The Underlying Universal Statistical Structure of Natural Datasets

Noam Levi · Yaron Oz

A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents

Kaiwen Wang · Dawen Liang · Nathan Kallus · Wen Sun

Towards Theoretical Understanding of Sequential Decision Making with Preference Feedback

Simone Drago · Marco Mussi · Alberto Maria Metelli

Is Best-of-N the Best of Them? Coverage, Scaling, and Optimality in Inference-Time Alignment

Audrey Huang · Adam Block · Qinghua Liu · Nan Jiang · Akshay Krishnamurthy · Dylan Foster

Cradle: Empowering Foundation Agents towards General Computer Control

Weihao Tan · Wentao Zhang · Xinrun Xu · Haochong Xia · gang Ding · Boyu Li · Bohan Zhou · Junpeng Yue · Jiechuan Jiang · Yewen Li · Ruyi An · Molei Qin · Chuqiao Zong · Longtao Zheng · YuJie Wu · Xiaoqiang Chai · Yifei Bi · Tianbao Xie · Pengjie Gu · Xiyun Li · Ceyao Zhang · Long Tian · Chaojie Wang · Xinrun Wang · Börje F. Karlsson · Bo An · Shuicheng YAN · Zongqing Lu

The Number of Trials Matters in Infinite-Horizon General-Utility Markov Decision Processes

Pedro Santos · Alberto Sardinha · Francisco S. Melo

Near-Optimal Sample Complexity for MDPs via Anchoring

Jongmin Lee · Mario Bravo · Roberto Cominetti

Low-Rank Thinning

Annabelle Carrell · Albert Gong · Abhishek Shetty · Raaz Dwivedi · Lester Mackey

PAC-Bayes Analysis for Recalibration in Classification

Masahiro Fujisawa · Futoshi Futami

When to Forget? Complexity Trade-offs in Machine Unlearning

Martin Van Waerebeke · Marco Lorenzi · Giovanni Neglia · Kevin Scaman

Optimal Decision Tree Pruning Revisited: Algorithms and Complexity

Juha Harviainen · Frank Sommer · Manuel Sorge · Stefan Szeider

Learning Minimum-Size BDDs: Towards Efficient Exact Algorithms

Christian Komusiewicz · André Schidler · Frank Sommer · Manuel Sorge · Luca Staus

Principal-Agent Bandit Games with Self-Interested and Exploratory Learning Agents

Junyan Liu · Lillian Ratliff

Leveraging Offline Data in Linear Latent Contextual Bandits

Chinmaya Kausik · Kevin Tan · Ambuj Tewari

Contextual Bandits for Unbounded Context Distributions

Puning Zhao · Rongfei Fan · Shaowei Wang · Li Shen · Qixin Zhang · Zong Ke · Tianhang Zheng

Exploiting Curvature in Online Convex Optimization with Delayed Feedback

Hao Qiu · Emmanuel Esposito · Mengxiao Zhang

Agent-as-a-Judge: Evaluate Agents with Agents

Mingchen Zhuge · Changsheng Zhao · Dylan Ashley · Wenyi Wang · Dmitrii Khizbullin · Yunyang Xiong · Zechun Liu · Ernie Chang · Raghuraman Krishnamoorthi · Yuandong Tian · Yangyang Shi · Vikas Chandra · Jürgen Schmidhuber

StealthInk: A Multi-bit and Stealthy Watermark for Large Language Models

Ya Jiang · Chuxiong Wu · Massieh Kordi Boroujeny · Brian Mark · Kai Zeng

What can large language models do for sustainable food?

Anna Thomas · Adam Yee · Andrew Mayne · Maya Mathur · Dan Jurafsky · Kristina Gligoric

MultiPDENet: PDE-embedded Learning with Multi-time-stepping for Accelerated Flow Simulation

Qi Wang · Yuan Mi · Wang Haoyun · Yi Zhang · Ruizhi Chengze · Hongsheng Liu · Ji-Rong Wen · Hao Sun

Perceptually Constrained Precipitation Nowcasting Model

Wenzhi Feng · Xutao Li · Zhe Wu · Kenghong Lin · Demin Yu · Yunming Ye · Yaowei Wang

DiffMS: Diffusion Generation of Molecules Conditioned on Mass Spectra

Montgomery Bohde · Mrunali Manjrekar · Runzhong Wang · Shuiwang Ji · Connor Coley

Physics-Informed Generative Modeling of Wireless Channels

Benedikt Böck · Andreas Oeldemann · Timo Mayer · Francesco Rossetto · Wolfgang Utschick

Diagonal Symmetrization of Neural Network Solvers for the Many-Electron Schrödinger Equation

Kevin Han Huang · Ni Zhan · Elif Ertekin · Peter Orbanz · Ryan P. Adams

Of Mice and Machines: A Comparison of Learning Between Real World Mice and RL Agents

Shuo Han · German Espinosa · Junda Huang · Daniel A. Dombeck · Malcolm MacIver · Bradly Stadie

Context-Informed Neural ODEs Unexpectedly Identify Broken Symmetries: Insights from the Poincaré–Hopf Theorem

In Huh · Changwook Jeong · Muhammad Alam

KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors

Benson Chen · Tomasz Danel · Gabriel Dreiman · Patrick McEnaney · Nikhil Jain · Kirill Novikov · Spurti Akki · Joshua L. Turnbull · Virja Pandya · Boris Belotserkovskii · Jared Weaver · Ankita Biswas · Dat Nguyen · Kent Gorday · Mohammad M Sultan · Nathaniel Stanley · Daniel Whalen · Divya Kanichar · Christoph Klein · Emily Fox · R. Watts

Wyckoff Transformer: Generation of Symmetric Crystals

Nikita Kazeev · Wei Nong · Ignat Romanov · Ruiming Zhu · Andrey Ustyuzhanin · Shuya Yamazaki · Kedar Hippalgaonkar

Modeling All-Atom Glycan Structures via Hierarchical Message Passing and Multi-Scale Pre-training

Minghao Xu · Jiaze Song · Keming Wu · Xiangxin Zhou · Bin Cui · Wentao Zhang

Elucidating the Design Space of Multimodal Protein Language Models

Cheng-Yen Hsieh · Xinyou Wang · Daiheng Zhang · Dongyu Xue · Fei YE · Shujian Huang · Zaixiang Zheng · Quanquan Gu

LDMol: A Text-to-Molecule Diffusion Model with Structurally Informative Latent Space Surpasses AR Models

Jinho Chang · Jong Chul YE

RISE: Radius of Influence based Subgraph Extraction for 3D Molecular Graph Explanation

Jingxiang Qu · Wenhan Gao · Jiaxing Zhang · Xufeng Liu · Hua Wei · Haibin Ling · Yi Liu

Neural Graph Matching Improves Retrieval Augmented Generation in Molecular Machine Learning

Runzhong Wang · Rui-Xi Wang · Mrunali Manjrekar · Connor Coley

Steering Protein Language Models

Long-Kai Huang · Rongyi Zhu · Bing He · Jianhua Yao

On the Query Complexity of Verifier-Assisted Language Generation

Edoardo Botta · Yuchen Li · Aashay Mehta · Jordan Ash · Cyril Zhang · Andrej Risteski

Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

Mohit Pandey · Gopeshh Subbaraj · Artem Cherkasov · Martin Ester · Emmanuel Bengio

Exploring Vision Semantic Prompt for Efficient Point Cloud Understanding

Yixin Zha · Chuxin Wang · Wenfei Yang · Tianzhu Zhang · Feng Wu

SITCOM: Step-wise Triple-Consistent Diffusion Sampling For Inverse Problems

Ismail Alkhouri · Shijun Liang · Cheng-Han Huang · Jimmy Dai · Qing Qu · Saiprasad Ravishankar · Rongrong Wang

Control and Realism: Best of Both Worlds in Layout-to-Image without Training

Bonan Li · Yinhan Hu · Songhua Liu · Xinchao Wang

Learning Adaptive Lighting via Channel-Aware Guidance

Qirui Yang · Peng-Tao Jiang · Hao Zhang · Jinwei Chen · Bo Li · Huanjing Yue · Jingyu Yang

Fine-Grained Captioning of Long Videos through Scene Graph Consolidation

Sanghyeok Chu · Seonguk Seo · Bohyung Han

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Xingyu Fu · Minqian Liu · Zhengyuan Yang · John Corring · Yijuan Lu · Jianwei Yang · Dan Roth · Dinei Florencio · Cha Zhang

TUMTraf VideoQA: Dataset and Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes

Xingcheng Zhou · Konstantinos Larintzakis · Hao Guo · Walter Zimmer · Mingyu Liu · Hu Cao · Jiajie Zhang · Venkatnarayanan Lakshminarasimhan · Leah Strand · Alois Knoll

Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing

Kaifeng Gao · Jiaxin Shi · Hanwang Zhang · Chunping Wang · Jun Xiao · Long Chen

BAME: Block-Aware Mask Evolution for Efficient N:M Sparse Training

Chenyi yang · Wenjie Nie · Yuxin Zhang · Yuhang Wu · Xiawu Zheng · GUANNAN JIANG · Rongrong Ji

DreamDPO: Aligning Text-to-3D Generation with Human Preferences via Direct Preference Optimization

Zhenglin Zhou · Xiaobo Xia · Fan Ma · Hehe Fan · Yi Yang · Tat-Seng Chua

Generalizable Multi-Camera 3D Object Detection from a Single Source via Fourier Cross-View Learning

Xue Zhao · Qinying Gu · Xinbing Wang · Chenghu Zhou · Nanyang Ye

Balancing Preservation and Modification: A Region and Semantic Aware Metric for Instruction-Based Image Editing

Zhuoying Li · Zhu Xu · Yuxin Peng · Yang Liu

HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration

Yushi Huang · Zining Wang · Ruihao Gong · Jing Liu · Xinjie Zhang · Jinyang Guo · Xianglong Liu · Jun Zhang

Exploring Invariance in Images through One-way Wave Equations

Yinpeng Chen · Dongdong Chen · Xiyang Dai · Mengchen Liu · Yinan Feng · Youzuo Lin · Lu Yuan · Zicheng Liu

LRA-QViT: Integrating Low-Rank Approximation and Quantization for Robust and Efficient Vision Transformers

Beom Jin Kang · NamJoon Kim · Hyun Kim

Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences

Yunhong Lu · Qichao Wang · Hengyuan Cao · Xiaoyin Xu · Min Zhang

Flopping for FLOPs: Leveraging Equivariance for Computational Efficiency

Georg Bökman · David Nordström · Fredrik Kahl

More Than Meets the Eye: Enhancing Multi-Object Tracking Even with Prolonged Occlusions

Bishoy Galoaa · Somaieh Amraee · Sarah Ostadabbas

GenZSL: Generative Zero-Shot Learning Via Inductive Variational Autoencoder

Shiming Chen · Dingjie Fu · Salman Khan · Fahad Khan

Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation

Zhihua Liu · Amrutha Saseendran · Lei Tong · Xilin He · Fariba Yousefi · Nikolay Burlutskiy · Dino Oglic · Tom Diethe · Philip Teare · Huiyu Zhou · Chen Jin

Balanced Learning for Domain Adaptive Semantic Segmentation

Wangkai Li · Rui Sun · Bohao Liao · Zhaoyang Li · Tianzhu Zhang

CSV-Occ: Fusing Multi-frame Alignment for Occupancy Prediction with Temporal Cross State Space Model and Central Voting Mechanism

Ziming Zhu · Yu Zhu · Jiahao Chen · Xiaofeng Ling · Huanlei Chen · Lihua Sun

Understanding Complexity in VideoQA via Visual Program Generation

Cristobal Eyzaguirre · Igor Vasiljevic · Achal Dave · Jiajun Wu · Rareș Ambruș · Thomas Kollar · Juan Carlos Niebles · Pavel Tokmakov

DVI:A Derivative-based Vision Network for INR

RUNZHAO YANG · Xiaolong Wu · Zhihong Zhang · Fabian Zhang · Tingxiong Xiao · Zongren Li · Kunlun He · Jinli Suo

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning

Yuanhuiyi Lyu · Xu Zheng · Lutao Jiang · Yibo Yan · Xin Zou · Huiyu Zhou · Linfeng Zhang · Xuming Hu

PiD: Generalized AI-Generated Images Detection with Pixelwise Decomposition Residuals

Xinghe Fu · Zhiyuan Yan · Zheng Yang · Taiping Yao · Yandan Zhao · Shouhong Ding · Xi Li

Perceptual-GS: Scene-adaptive Perceptual Densification for Gaussian Splatting

Hongbi ZHOU · Zhangkai NI

WorldSimBench: Towards Video Generation Models as World Simulators

Yiran Qin · Zhelun Shi · Jiwen Yu · Xijun Wang · Enshen Zhou · Lijun Li · Zhenfei Yin · Xihui Liu · Lu Sheng · Jing Shao · LEI BAI · Ruimao Zhang

FeatSharp: Your Vision Model Features, Sharper

Mike Ranzinger · Greg Heinrich · Pavlo Molchanov · Bryan Catanzaro · Andrew Tao

GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing

Akashah Shabbir · Ilmuz Zaman Mohammed Zumri · Mohammed Bennamoun · Fahad Khan · Salman Khan

EFDTR: Learnable Elliptical Fourier Descriptor Transformer for Instance Segmentation

Jiawei Cao · Chaochen Gu · Hao Cheng · Xiaofeng Zhang · Kaijie Wu · Changsheng Lu

TimeBase: The Power of Minimalism in Efficient Long-term Time Series Forecasting

Qihe Huang · Zhengyang Zhou · Kuo Yang · Zhongchao Yi · Xu Wang · Yang Wang

Separating Knowledge and Perception with Procedural Data

Adrian Rodriguez-Munoz · Manel Baradad · Phillip Isola · Antonio Torralba

Semantics-aware Test-time Adaptation for 3D Human Pose Estimation

Qiuxia Lin · Glory Rongyu CHEN · Kerui Gu · Angela Yao

Are High-Quality AI-Generated Images More Difficult for Models to Detect?

Yao Xiao · Binbin Yang · Weiyan Chen · Jiahao Chen · Zijie Cao · ZiYi Dong · Xiangyang Ji · Liang Lin · Wei Ke · Pengxu Wei

Position: Democratic AI is Possible. The Democracy Levels Framework Shows How It Might Work.

Aviv Ovadya · Kyle Redman · Luke Thorburn · Quan Ze Chen · Oliver Smith · Flynn Devine · Andrew Konya · Smitha Milli · Manon Revel · Kevin Feng · Amy Zhang · Bilva Chandra · Michiel Bakker · Atoosa Kasirzadeh

Few-Shot Learner Generalizes Across AI-Generated Image Detection

Shiyu Wu · Jing Liu · Jing Li · Yequan Wang

CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty

Harry Zhang · Luca Carlone

Contrastive Visual Data Augmentation

Yu Zhou · Bingxuan Li · Mohan Tang · Xiaomeng Jin · Te-Lin Wu · Kuan-Hao Huang · Heng Ji · Kai-Wei Chang · Nanyun Peng

Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making

Xu Wan · Wenyue Xu · Chao Yang · Mingyang Sun

Causal Invariance-aware Augmentation for Brain Graph Contrastive Learning

Minqi Yu · Jinduo Liu · Junzhong Ji

PyTDC: A multimodal machine learning training, evaluation, and inference platform for biomedical foundation models

Alex Velez-Arce · Marinka Zitnik

EmoGrowth: Incremental Multi-label Emotion Decoding with Augmented Emotional Relation Graph

Kaicheng Fu · Changde Du · Jie Peng · Kunpeng Wang · Shuangchen Zhao · Xiaoyu Chen · Huiguang He

ADIOS: Antibody Development via Opponent Shaping

Sebastian Towers · Aleksandra Kalisz · Philippe Robert · Alicia Higueruelo · Francesca Vianello · Chloe Tsai · Harrison Steel · Jakob Foerster

Modalities Contribute Unequally: Enhancing Medical Multi-modal Learning through Adaptive Modality Token Re-balancing

Jie Peng · Jenna Ballard · Mohan Zhang · Sukwon Yun · Jiayi Xin · Qi Long · Yanyong Zhang · Tianlong Chen

Maximum Entropy Reinforcement Learning with Diffusion Policy

Xiaoyi Dong · Jian Cheng · Xi Zhang

Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting

Zhining Liu · Ze Yang · Xiao Lin · Ruizhong Qiu · Tianxin Wei · Yada Zhu · Hendrik Hamann · Jingrui He · Hanghang Tong

Reinforcement Learning Control of a Physical Robot Device for Assisted Human Walking without a Simulator

junmin zhong · Emiliano Quinones Yumbla · Seyed Yousef Soltanian · Ruofan Wu · Wenlong Zhang · Jennie Si

Empowering World Models with Reflection for Embodied Video Prediction

Xiaowei Chi · Chun-Kai Fan · Hengyuan Zhang · Xingqun Qi · Rongyu Zhang · Anthony Chen · Chi-Min Chan · Wei Xue · Qifeng Liu · Shanghang Zhang · Yike Guo

Hierarchical Equivariant Policy via Frame Transfer

Haibo Zhao · Dian Wang · Yizhe Zhu · Xupeng Zhu · Owen Howell · Linfeng Zhao · Yaoyao Qian · Robin Walters · Robert Platt

VIP: Vision Instructed Pre-training for Robotic Manipulation

Zhuoling Li · LiangLiang Ren · Jinrong Yang · Yong Zhao · Xiaoyang Wu · Zhenhua Xu · Xiang Bai · Hengshuang Zhao

Test-Time Adaptation for Online Vision-Language Navigation with Feedback-based Reinforcement Learning

Sung June Kim · Gyeongrok Oh · Heeju Ko · Daehyun Ji · Dongwook Lee · Byung-Jun Lee · Sujin Jang · Sangpil Kim

TS-SNN: Temporal Shift Module for Spiking Neural Networks

Kairong Yu · Tianqing Zhang · Qi Xu · Gang Pan · Hongwei Wang

Discovering Symbolic Cognitive Models from Human and Animal Behavior

Pablo Samuel Castro · Nenad Tomasev · Ankit Anand · Navodita Sharma · Rishika Mohanta · Aparna Dev · Kuba Perlin · Siddhant Jain · Kyle Levin · Noemi Elteto · Will Dabney · Alexander Novikov · Glenn Turner · Maria Eckstein · Nathaniel Daw · Kevin Miller · Kimberly Stachenfeld

Hybrid Spiking Vision Transformer for Object Detection with Event Cameras

Qi Xu · Jie Deng · Jiangrong Shen · Biwu Chen · Huajin Tang · Gang Pan

Neural Representational Consistency Emerges from Probabilistic Neural-Behavioral Representation Alignment

Yu Zhu · Chunfeng Song · Wanli Ouyang · Shan Yu · Tiejun Huang

Accurate Identification of Communication Between Multiple Interacting Neural Populations

Belle Liu · Jacob I Sacks · Matthew Golub

Efficient ANN-SNN Conversion with Error Compensation Learning

chang liu · Jiangrong Shen · Xuming Ran · Mingkun Xu · Qi Xu · Yi Xu · Gang Pan

MindAligner: Explicit Brain Functional Alignment for Cross-Subject Visual Decoding from Limited fMRI Data

Yuqin Dai · Zhouheng Yao · Chunfeng Song · Qihao Zheng · Weijian Mai · Kunyu Peng · Shuai Lu · Wanli Ouyang · Jian Yang · Jiamin Wu

Agent Reviewers: Domain-specific Multimodal Agents with Shared Memory for Paper Review

Kai Lu · Shixiong Xu · Jinqiu Li · Kun Ding · Gaofeng Meng

ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior

Zhongweiyang Xu · Xulin Fan · Zhong-Qiu Wang · Xilin Jiang · Romit Roy Choudhury

PertEval-scFM: Benchmarking Single-Cell Foundation Models for Perturbation Effect Prediction

Aaron Wenteler · Martina Occhetta · Nikhil Branson · Victor Curean · Magdalena Huebner · William Dee · William Connell · Siu Chung · Alex Hawkins-Hooker · Yasha Ektefaie · César Miguel Valdez Córdova · Amaya Gallagher-Syed

SPACE: Your Genomic Profile Predictor is a Powerful DNA Foundation Model

Zhao Yang · jiwei zhu · Bing Su

Protein Structure Tokenization: Benchmarking and New Recipe

Xinyu Yuan · Zichen Wang · Marcus Collins · Huzefa Rangwala

BoxLM: Unifying Structures and Semantics of Medical Concepts for Diagnosis Prediction in Healthcare

Yanchao Tan · Hang Lv · Yunfei Zhan · Guofang Ma · Bo Xiong · Carl Yang

Drug-TTA: Test-Time Adaptation for Drug Virtual Screening via Multi-task Meta-Auxiliary Learning

Ao Shen · Ming'zhi Yuan · Yingfan MA · Jie Du · qiao Huang · Manning Wang

CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models

David Dai · Peilin Chen · Malinda Lu · Daniel A. Li · Haowen Wei · Hejie Cui · Paul Pu Liang

Optimal Information Retention for Time-Series Explanations

Jinghang Yue · Jing Wang · Lu Zhang · Shuo Zhang · Da Li · Zhaoyang Ma · Youfang Lin

Supervised Contrastive Learning from Weakly-Labeled Audio Segments for Musical Version Matching

Joan Serrà · Recep Oguz Araz · Dmitry Bogdanov · Yuki Mitsufuji

FLAM: Frame-Wise Language-Audio Modeling

Yusong Wu · Christos Tsirigotis · Ke Chen · Cheng-Zhi Anna Huang · Aaron Courville · Oriol Nieto · Prem Seetharaman · Justin Salamon

The Case for Learned Provenance-based System Behavior Baseline

Yao Zhu · Zhenyuan LI · yangyang wei · Shouling Ji

Enhancing Graph Contrastive Learning for Protein Graphs from Perspective of Invariance

YUSONG WANG · Shiyin Tan · Jialun Shen · Yicheng Xu · Haobo Song · Qi Xu · Prayag Tiwari · Mingkun Xu

Fast and Low-Cost Genomic Foundation Models via Outlier Removal

Haozheng Luo · Chenghao Qiu · Maojiang Su · Zhihan Zhou · Zoe Mehta · Guo Ye · Jerry Yao-Chieh Hu · Han Liu

Analytical Lyapunov Function Discovery: An RL-based Generative Approach

Haohan Zou · Jie Feng · Hao Zhao · Yuanyuan Shi

OptMATH: A Scalable Bidirectional Data Synthesis Framework for Optimization Modeling

Hongliang Lu · Zhonglin Xie · Yaoyu Wu · Can Ren · Yuxuan Chen · Zaiwen Wen

Stochastic Deep Restoration Priors for Imaging Inverse Problems

Yuyang Hu · Albert Peng · Weijie Gan · Peyman Milanfar · Mauricio Delbracio · Ulugbek Kamilov

ROS: A GNN-based Relax-Optimize-and-Sample Framework for Max-$k$-Cut Problems

Yeqing Qiu · Ye XUE · Akang Wang · Yiheng Wang · Qingjiang Shi · Zhiquan Luo

An in depth look at the Procrustes-Wasserstein distance: properties and barycenters

Davide Adamo · Marco Corneli · Manon Vuillien · Emmanuelle Vila

MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition

ning wang · Zekun Li · Tongxin Bai · Man Yao · Zhen Qin · Guoqi Li

Safe-EF: Error Feedback for Non-smooth Constrained Optimization

Rustem Islamov · Yarden As · Ilyas Fatkhullin

Causality Inspired Federated Learning for OOD Generalization

Jiayuan Zhang · Xuefeng Liu · Jianwei Niu · Shaojie Tang · Haotian Yang · Xinghao Wu

Beyond Communication Overhead: A Multilevel Monte Carlo Approach for Mitigating Compression Bias in Distributed Learning

Ze'ev Zukerman · Bassel Hamoud · Kfir Levy

Efficient Optimization with Orthogonality Constraint: a Randomized Riemannian Submanifold Method

Andi Han · Pierre-Louis Poirion · Akiko Takeda

Attention-Only Transformers via Unrolled Subspace Denoising

Peng Wang · Yifu Lu · Yaodong Yu · Druv Pai · Qing Qu · Yi Ma

Recommendations with Sparse Comparison Data: Provably Fast Convergence for Nonconvex Matrix Factorization

Suryanarayana Sankagiri · Jalal Etesami · Matthias Grossglauser

Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-Tailed

Savelii Chezhegov · Klyukin Yaroslav · Andrei Semenov · Aleksandr Beznosikov · Alexander Gasnikov · Samuel Horváth · Martin Takac · Eduard Gorbunov

Clipped SGD Algorithms for Performative Prediction: Tight Bounds for Stochastic Bias and Remedies

Qiang Li · Michal Yemini · Hoi To Wai

Hyperspherical Normalization for Scalable Deep Reinforcement Learning

Hojoon Lee · Youngdo Lee · Takuma Seno · Donghu Kim · Peter Stone · Jaegul Choo

Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models

Minh-Tung Luu · Younghwan Lee · Donghoon Lee · Sunho Kim · MinJun Kim · Chang Yoo

Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning

Motoki Omura · Kazuki Ota · Takayuki Osa · Yusuke Mukuta · Tatsuya Harada

Constrained Exploitability Descent: An Offline Reinforcement Learning Method for Finding Mixed-Strategy Nash Equilibrium

Runyu Lu · Yuanheng Zhu · Dongbin Zhao

Q-Supervised Contrastive Representation: A State Decoupling Framework for Safe Offline Reinforcement Learning

Zhihe Yang · Yunjian Xu · Yang Zhang

CERTAIN: Context Uncertainty-aware One-Shot Adaptation for Context-based Offline Meta Reinforcement Learning

Hongtu Zhou · Ruiling Yang · Yakun Zhu · Haoqi Zhao · Hai Zhang · Di Zhang · Junqiao Zhao · Chen Ye · Changjun Jiang

Habitizing Diffusion Planning for Efficient and Effective Decision Making

Haofei Lu · Yifei Shen · Dongsheng Li · Junliang Xing · Dongqi Han

Off-Policy Evaluation under Nonignorable Missing Data

Han Wang · Yang Xu · Wenbin Lu · Rui Song

Strategic Planning: A Top-Down Approach to Option Generation

Max Ruiz Luyten · Antonin Berthon · Mihaela van der Schaar

Action-Dependent Optimality-Preserving Reward Shaping

Grant Forbes · Jianxun Wang · Leonardo Villalobos-Arias · Arnav Jhala · David Roberts

Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner

Chenyou Fan · Chenjia Bai · Zhao Shan · Haoran He · Yang Zhang · Zhen Wang

Wasserstein Policy Optimization

David Pfau · Ian Davies · Diana Borsa · João Madeira Araujo · Brendan Tracey · Hado van Hasselt

Controlling Underestimation Bias in Constrained Reinforcement Learning for Safe Exploration

Shiqing Gao · Jiaxin Ding · Luoyi Fu · Xinbing Wang

Temporal Difference Flows

Jesse Farebrother · Matteo Pirotta · Andrea Tirinzoni · REMI MUNOS · Alessandro Lazaric · Ahmed Touati

Learning Fused State Representations for Control from Multi-View Observations

Zeyu Wang · Yao-Hui Li · Xin Li · Hongyu Zang · Romain Laroche · Riashat Islam

Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks

Luise Ge · Michael Lanier · Anindya Sarkar · Bengisu Guresti · Chongjie Zhang · Yevgeniy Vorobeychik

Model-Based Exploration in Monitored Markov Decision Processes

Alireza Kazemipour · Matthew Taylor · Michael Bowling

ADDQ: Adaptive distributional double Q-learning

Leif Döring · Benedikt Wille · Maximilian Birr · Mihail Bîrsan · Martin Slowik

Zero-Shot Offline Imitation Learning via Optimal Transport

Thomas Rupf · Marco Bagatella · Nico Gürtler · Jonas Frey · Georg Martius

Gap-Dependent Bounds for Federated $Q$-Learning

Haochen Zhang · Zhong Zheng · Lingzhou Xue

Neural Event-Triggered Control with Optimal Scheduling

Luan Yang · Jingdong Zhang · Qunxi Zhu · Wei Lin

Understanding High-Dimensional Bayesian Optimization

Leonard Papenmeier · Matthias Poloczek · Luigi Nardi

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

Guozheng Ma · Lu Li · Zilin Wang · Li Shen · Pierre-Luc Bacon · Dacheng Tao

Policy-labeled Preference Learning: Is Preference Enough for RLHF?

Taehyun Cho · Seokhun Ju · Seungyub Han · Dohyeong Kim · Kyungjae Lee · Jungwoo Lee

Reinforcement Learning with Adaptive Reward Modeling for Expensive-to-Evaluate Systems

Hongyuan Su · Yu Zheng · Yuan Yuan · Yuming Lin · Depeng Jin · Yong Li

Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning

Bryan L. M. de Oliveira · Luana G. B. Martins · Bruno Brandão · Murilo L. da Luz · Telma Woerle de Lima Soares · Luckeciano C. Melo

IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic

Stefano Viel · Luca Viano · Volkan Cevher

Provably Efficient Exploration in Inverse Constrained Reinforcement Learning

Bo Yue · Jian Li · Guiliang Liu

Wolfpack Adversarial Attack for Robust Multi-Agent Reinforcement Learning

Sunwoo Lee · Jaebak Hwang · Yonghyeon Jo · Seungyul Han

Learning Progress Driven Multi-Agent Curriculum

Wenshuai Zhao · Zhiyuan Li · Joni Pajarinen

Distributionally Robust Multi-Agent Reinforcement Learning for Dynamic Chute Mapping

Guangyi Liu · Suzan Iloglu · Michael Caldara · Joseph Durham · Michael Zavlanos

Graph Diffusion for Robust Multi-Agent Coordination

Xianghua Zeng · Hang Su · Zhengyi Wang · Zhiyuan LIN

Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Andreas Kontogiannis · Konstantinos Papathanasiou · Yi Shen · Giorgos Stamou · Michael Zavlanos · George Vouros

HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning

Chiqiang Liu · Dazi Li

Reidentify: Context-Aware Identity Generation for Contextual Multi-Agent Reinforcement Learning

Zhiwei XU · Kun Hu · Xin Xin · Weiliang Meng · Yiwei Shi · Hangyu Mao · Bin Zhang · dapeng Li · Jiangjin Yin

Cross-environment Cooperation Enables Zero-shot Multi-agent Coordination

Kunal Jha · Wilka Carvalho · Yancheng Liang · Simon Du · Max Kleiman-Weiner · Natasha Jaques

Improving Transformer World Models for Data-Efficient RL

Antoine Dedieu · Joseph Ortiz · Xinghua Lou · Carter Wendelken · J Swaroop Guntupalli · Wolfgang Lehrach · Miguel Lazaro-Gredilla · Kevin Murphy

Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction

Yiting He · Zhishuai Liu · Weixin Wang · Pan Xu

Navigating the Social Welfare Frontier: Portfolios for Multi-objective Reinforcement Learning

Cheol Kim · Jai Moondra · Shresth Verma · Madeleine Pollack · Lingkai Kong · Milind Tambe · Swati Gupta

Flow-based Domain Randomization for Learning and Sequencing Robotic Skills

Aidan Curtis · Eric Li · Michael S Noseworthy · Nishad Gothoskar · Sachin Chitta · Hui Li · Leslie Kaelbling · Nicole Carey

Monte-Carlo Tree Search with Uncertainty Propagation via Optimal Transport

Tuan Dam · Pascal Stenger · Lukas Schneider · Joni Pajarinen · Carlo D'Eramo · Odalric-Ambrym Maillard

Subgoal-Guided Policy Heuristic Search with Learned Subgoals

Jake Tuero · Michael Buro · Levi Lelis

Online Robust Reinforcement Learning Through Monte-Carlo Planning

Tuan Dam · Kishan Panaganti · Brahim Driss · Adam Wierman

DiLQR: Differentiable Iterative Linear Quadratic Regulator via Implicit Differentiation

Shuyuan Wang · Philip D. Loewen · Michael Forbes · Bhushan Gopaluni · Wei Pan

Convex Markov Games: A New Frontier for Multi-Agent Reinforcement Learning

Ian Gemp · Andreas Haupt · Luke Marris · Siqi Liu · Georgios Piliouras

Beyond Self-Interest: How Group Strategies Reshape Content Creation in Recommendation Platforms?

Yaolong Yu · Fan Yao · Sinno Jialin Pan

Reducing Variance of Stochastic Optimization for Approximating Nash Equilibria in Normal-Form Games

Linjian Meng · Wubing Chen · Wenbin Li · Tianpei Yang · Youzhi Zhang · Yang Gao

Function Encoders: A Principled Approach to Transfer Learning in Hilbert Spaces

Tyler Ingebrand · Adam Thorpe · Ufuk Topcu

Permutation Equivariant Neural Networks for Symmetric Tensors

Edward Pearce-Crump

Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression

Jingfeng Wu · Peter Bartlett · Matus Telgarsky · Bin Yu

A Rescaling-Invariant Lipschitz Bound Based on Path-Metrics for Modern ReLU Network Parameterizations

Antoine Gonon · Nicolas Brisebarre · Elisa Riccietti · Rémi Gribonval

Deep Ridgelet Transform and Unified Universality Theorem for Deep and Shallow Joint-Group-Equivariant Machines

Sho Sonoda · Yuka Hashimoto · Isao Ishikawa · Masahiro Ikeda

Eigen Analysis of Conjugate Kernel and Neural Tangent Kernel

Xiangchao Li · Xiao Han · Qing Yang

Towards characterizing the value of edge embeddings in Graph Neural Networks

Dhruv Rohatgi · Tanya Marwah · Zachary Lipton · Jianfeng Lu · Ankur Moitra · Andrej Risteski

Optimal Task Order for Continual Learning of Multiple Tasks

Ziyan Li · Naoki Hiratani

Feature learning from non-Gaussian inputs: the case of Independent Component Analysis in high dimensions

Fabiola Ricci · Lorenzo Bardone · Sebastian Goldt

Algorithms with Calibrated Machine Learning Predictions

Judy Hanwen Shen · Ellen Vitercik · Anders Wikum

On the Similarities of Embeddings in Contrastive Learning

Chungpa Lee · Sehee Lim · Kibok Lee · Jy-yong Sohn

Measuring Diversity: Axioms and Challenges

Mikhail Mironov · Liudmila Prokhorenkova

World Model Implanting for Test-time Adaptation of Embodied Agents

Minjong Yoo · Jinwoo Jang · Sihyung Yoon · Honguk Woo

Goal-Oriented Skill Abstraction for Offline Multi-Task Reinforcement Learning

Jinmin He · Kai Li · Yifan Zang · Haobo Fu · Qiang Fu · Junliang Xing · Jian Cheng

Benchmarking Quantum Reinforcement Learning

Nico Meyer · Christian Ufrecht · George Yammine · Georgios Kontes · Christopher Mutschler · Daniel Scherer

Preference Controllable Reinforcement Learning with Advanced Multi-Objective Optimization

Yucheng Yang · Tianyi Zhou · Mykola Pechenizkiy · Meng Fang

An Analysis of Quantile Temporal-Difference Learning

Mark Rowland · Remi Munos · Mohammad Gheshlaghi Azar · Yunhao Tang · Georg Ostrovski · Anna Harutyunyan · Karl Tuyls · Marc G. Bellemare · Will Dabney

The Empirical Mean is Minimax Optimal for Local Glivenko-Cantelli

Doron Cohen · Aryeh Kontorovich · Roi Weiss

Design Considerations in Offline Preference-based RL

Alekh Agarwal · Christoph Dann · Teodor Vanislavov Marinov

Adversarial Robust Generalization of Graph Neural Networks

Chang Cao · Han Li · Yulong Wang · Rui Wu · Hong Chen

Ehrenfeucht-Haussler Rank and Chain of Thought

Pablo Barcelo · Alexander Kozachinskiy · Tomasz Steifer

Transfer Q-Learning with Composite MDP Structures

Jinhang Chai · Elynn Chen · Lin Yang

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Yuanhe Zhang · Fanghui Liu · Yudong Chen

Weak-to-Strong Generalization Even in Random Feature Networks, Provably

Marko Medvedev · Kaifeng Lyu · Dingli Yu · Sanjeev Arora · Zhiyuan Li · Nati Srebro

Generalization and Robustness of the Tilted Empirical Risk

Gholamali Aminian · Amir R. Asadi · Tian Li · Ahmad Beirami · Gesine Reinert · Samuel Cohen

Stability and Generalization Analysis of Decentralized SGD: Sharper Bounds Beyond Lipschitzness and Smoothness

Shuang Zeng · Yunwen Lei

Understanding Generalization in Quantum Machine Learning with Margins

TAK HUR · Daniel Kyungdeock Park

Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings

Minh Hieu Nong · Antoine Ledent

A Unified Framework for Generalization Error Analysis of Learning with Arbitrary Discrete Weak Features

Kosuke Sugiyama · Masato Uchida

On the Convergence of Continuous Single-timescale Actor-critic

Xuyang Chen · Lin Zhao

Discrepancies are Virtue: Weak-to-Strong Generalization through Lens of Intrinsic Dimension

Yijun Dong · Yicheng Li · Yunai Li · Jason Lee · Qi Lei

Retraining with Predicted Hard Labels Provably Increases Model Accuracy

Rudrajit Das · Inderjit Dhillon · Alessandro Epasto · Adel Javanmard · Jieming Mao · Vahab Mirrokni · Sujay Sanghavi · Peilin Zhong

Revisiting Differentially Private Algorithms for Decentralized Online Learning

Xiaoyu Wang · Wenhao Yang · Chang Yao · Mingli Song · Yuanyu Wan

Near-Optimal Consistency-Robustness Trade-Offs for Learning-Augmented Online Knapsack Problems

Mohammadreza Daneshvaramoli · Helia Karisani · Adam Lechowicz · Bo Sun · Cameron Musco · Mohammad Hajiesmaili

Improved Online Confidence Bounds for Multinomial Logistic Bandits

Joongkyu Lee · Min-hwan Oh

A Parametric Contextual Online Learning Theory of Brokerage

François Bachoc · Tommaso Cesari · Roberto Colomboni

Feasible Action Search for Bandit Linear Programs via Thompson Sampling

Aditya Gangrade · Aldo Pacchiano · Clay Scott · Venkatesh Saligrama

Tokenized Bandit for LLM Decoding and Alignment

Suho Shin · Chenghao Yang · Haifeng Xu · MohammadTaghi Hajiaghayi

Online Linear Classification with Massart Noise

Ilias Diakonikolas · Vasilis Kontonis · Christos Tzamos · Nikos Zarifis

Go to Event Page

Poster

Poster Session 2 East

4:30 PM - 7:00 PM

323 Events in this session

Uncertainty-Based Extensible Codebook for Discrete Federated Learning in Heterogeneous Data Silos

Tianyi Zhang · Yu Cao · Dianbo Liu

DRAG: Data Reconstruction Attack using Guided Diffusion

Wa-Kin Lei · Jun-Cheng Chen · Shang-Tse Chen

Approximate Differential Privacy of the $\ell_2$ Mechanism

Matthew Joseph · Alex Kulesza · Alexander Yu

Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion

Tianyuan Zou · Yang Liu · Peng Li · Yufei Xiong · Jianqing Zhang · Jingjing Liu · Xiaozhou Ye · Ye Ouyang · Ya-Qin Zhang

Empirical Privacy Variance

Yuzheng Hu · Fan Wu · Ruicheng Xian · Yuhang Liu · Lydia Zakynthinou · Pritish Kamath · Chiyuan Zhang · David Forsyth

Textual Unlearning Gives a False Sense of Unlearning

Jiacheng Du · Zhibo Wang · Jie Zhang · Xiaoyi Pang · Jiahui Hu · Kui Ren

Private Model Personalization Revisited

Conor Snedeker · Xinyu Zhou · Raef Bassily

Equivariant Polynomial Functional Networks

Thieu Vo · Viet Hoang Tran · Tho Tran Huu · An Nguyen The · Thanh Tran · Minh-Khoi Nguyen-Nhat · Duy-Tung Pham · Tan Nguyen

Theoretically Unmasking Inference Attacks Against LDP-Protected Clients in Federated Vision Models

Quan Nguyen · Minh Vu · Truc Nguyen · My T. Thai

Scalable Private Partition Selection via Adaptive Weighting

Justin Chen · Vincent Cohen-Addad · Alessandro Epasto · Morteza Zadimoghaddam

Subgroups Matter for Robust Bias Mitigation

Anissa Alloula · Charles Jones · Ben Glocker · Bartlomiej W. Papiez

Multiaccuracy and Multicalibration via Proxy Groups

Beepul Bharti · Mary Clemens-Sewall · Paul H. Yi · Jeremias Sulam

Dissecting Submission Limit in Desk-Rejections: A Mathematical Analysis of Fairness in AI Conference Policies

Yuefan Cao · Xiaoyu Li · Yingyu Liang · Zhizhou Sha · Zhenmei Shi · Zhao Song · Jiahao Zhang

SPRI: Aligning Large Language Models with Context-Situated Principles

Hongli Zhan · Muneeza Azmat · Raya Horesh · Junyi Jessy Li · Mikhail Yurochkin

Massive Values in Self-Attention Modules are the Key to Contextual Knowledge Understanding

Mingyu Jin · Kai Mei · Wujiang Xu · Mingjie Sun · Ruixiang Tang · Mengnan Du · Zirui Liu · Yongfeng Zhang

SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders

Bartosz Cywiński · Kamil Deja

Constrain Alignment with Sparse Autoencoders

Qingyu Yin · Chak Tou Leong · Hongbo Zhang · Minjun Zhu · Hanqi Yan · Qiang Zhang · Yulan He · Wenjie Li · Jun Wang · Yue Zhang · Linyi Yang

Representation Shattering in Transformers: A Synthetic Study with Knowledge Editing

Kento Nishi · Rahul Ramesh · Maya Okawa · Mikail Khona · Hidenori Tanaka · Ekdeep Singh Lubana

Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning

Changsheng Wang · Yihua Zhang · jinghan jia · Parikshit Ram · Dennis Wei · Yuguang Yao · Soumyadeep Pal · Nathalie Baracaldo · Sijia Liu

Inducing, Detecting and Characterising Neural Modules: A Pipeline for Functional Interpretability in Reinforcement Learning

Anna Soligo · Pietro Ferraro · David Boyle

A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal Regression

Victor Dheur · Matteo Fontana · Yorick Estievenart · Naomi Desobry · Souhaib Ben Taieb

SEMU: Singular Value Decomposition for Efficient Machine Unlearning

Marcin Sendera · Łukasz Struski · Kamil Książek · Kryspin Musiol · Jacek Tabor · Dawid Rymarczyk

From Individual Experience to Collective Evidence: A Reporting-Based Framework for Identifying Systemic Harms

Jessica Dai · Paula Gradu · Inioluwa Raji · Benjamin Recht

One Wave To Explain Them All: A Unifying Perspective On Feature Attribution

Gabriel Kasmi · Amandine Brunetto · Thomas Fel · Jayneel Parekh

Explaining, Fast and Slow: Abstraction and Refinement of Provable Explanations

Shahaf Bassan · Yizhak Elboher · Tobias Ladner · Matthias Althoff · Guy Katz

Algorithmic Recourse for Long-Term Improvement

Kentaro Kanamori · Ken Kobayashi · Satoshi Hara · Takuya Takagi

Textural or Textual: How Vision-Language Models Read Text in Images

Hanzhang Wang · Qingyuan Ma

A Non-Asymptotic Convergent Analysis for Scored-Based Graph Generative Model via a System of Stochastic Differential Equations

Junwei Su · Chuan Wu

Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment

Harrish Thasarathan · Julian Forsyth · Thomas Fel · Matthew Kowal · Konstantinos Derpanis

GEFA: A General Feature Attribution Framework Using Proxy Gradient Estimation

Yi Cai · Thibaud Ardoin · Gerhard Wunder

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Clément Bonet · Christophe Vauthier · Anna Korba

Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection

Matteo Zecchin · Sangwoo Park · Osvaldo Simeone

e-GAI: e-value-based Generalized $\alpha$-Investing for Online False Discovery Rate Control

Yifan Zhang · Zijian Wei · Haojie Ren · Changliang Zou

Flexible Tails for Normalizing Flows

Tennessee Hickling · Dennis Prangle

Multilayer Matrix Factorization via Dimension-Reducing Diffusion Variational Inference

Junbin Liu · Farzan Farnia · Wing-Kin Ma

Synonymous Variational Inference for Perceptual Image Compression

Zijian Liang · Kai Niu · Changshuo Wang · Jin Xu · Ping Zhang

An Efficient Search-and-Score Algorithm for Ancestral Graphs using Multivariate Information Scores for Complex Non-linear and Categorical Data

Nikita Lagrange · Herve Isambert

The Polynomial Stein Discrepancy for Assessing Moment Convergence

Narayan Srinivasan · Matthew Sutton · Christopher Drovandi · Leah South

Importance Corrected Neural JKO Sampling

Johannes Hertrich · Robert Gruhlke

A Generic Family of Graphical Models: Diversity, Efficiency, and Heterogeneity

Yufei Huang · Changhu Wang · Junjie Tang · Weichi Wu · Ruibin Xi

G-Sim: Generative Simulations with Large Language Models and Gradient-Free Calibration

Samuel Holt · Max Ruiz Luyten · Antonin Berthon · Mihaela van der Schaar

Improving the Statistical Efficiency of Cross-Conformal Prediction

Matteo Gasparin · Aaditya Ramdas

Gridded Transformer Neural Processes for Spatio-Temporal Data

Matthew Ashman · Cristiana Diaconu · Eric Langezaal · Adrian Weller · Richard E Turner

Rethinking Aleatoric and Epistemic Uncertainty

Freddie Bickford Smith · Jannik Kossen · Eleanor Trollope · Mark van der Wilk · Adam Foster · Tom Rainforth

Addressing Misspecification in Simulation-based Inference through Data-driven Calibration

Antoine Wehenkel · Juan L. Gamella · Ozan Sener · Jens Behrmann · Guillermo Sapiro · Jörn Jacobsen · Marco Cuturi

BARNN: A Bayesian Autoregressive and Recurrent Neural Network

Dario Coscia · Max Welling · Nicola Demo · Gianluigi Rozza

A Bayesian Model Selection Criterion for Selecting Pretraining Checkpoints

Michael Munn · Susan Wei

Optimization Proxies using Limited Labeled Data and Training Time -- A Semi-Supervised Bayesian Neural Network Approach

Parikshit Pareek · Abhijith Jayakumar · Kaarthik Sundar · Sidhant Misra · Deepjyoti Deka

Step-DAD: Semi-Amortized Policy-Based Bayesian Experimental Design

Marcel Hedman · Desi Ivanova · Cong Guan · Tom Rainforth

Finite-Time Analysis of Discrete-Time Stochastic Interpolants

Yuhao Liu · Yu Chen · Rui Hu · Longbo Huang

FedSMU: Communication-Efficient and Generalization-Enhanced Federated Learning through Symbolic Model Updates

Xinyi Lu · Hao Zhang · Chenglin Li · Weijia Lu · ZHIFEI YANG · Wenrui Dai · xiaodong Zhang · Xiaofeng Ma · Can Zhang · Junni Zou · Hongkai Xiong

Flexible, Efficient, and Stable Adversarial Attacks on Machine Unlearning

Zihan Zhou · Yang Zhou · Zijie Zhang · Lingjuan Lyu · Da Yan · Ruoming Jin · Dejing Dou

Fully Dynamic Euclidean Bi-Chromatic Matching in Sublinear Update Time

Gramoz Goranci · Peter Kiss · Neel Patel · Martin Seybold · Eva Szilagyi · Da Wei Zheng

Towards an Explainable Comparison and Alignment of Feature Embeddings

Mohammad Jalali · Bahar Dibaei Nia · Farzan Farnia

Expressive Score-Based Priors for Distribution Matching with Geometry-Preserving Regularization

Ziyu Gong · Jim Lim · David I. Inouye

Leveraging Diffusion Model as Pseudo-Anomalous Graph Generator for Graph-Level Anomaly Detection

Jinyu Cai · Yunhe Zhang · Fusheng Liu · See-Kiong Ng

Neurosymbolic World Models for Sequential Decision Making

Leonardo Hernandez Cano · Maxine Perroni-Scharf · Neil Dhir · Arun Ramamurthy · Armando Solar-Lezama

Test-time Correlation Alignment

Linjing You · Jiabao Lu · Xiayuan Huang

Parameter-Efficient Fine-Tuning of State Space Models

Kevin Galim · Wonjun Kang · Yuchen Zeng · HYUNG IL KOO · Kangwook Lee

Pareto Merging: Multi-Objective Optimization for Preference-Aware Model Merging

Weiyu CHEN · James Kwok

Balancing the Scales: A Theoretical and Algorithmic Framework for Learning from Imbalanced Data

Corinna Cortes · Anqi Mao · Mehryar Mohri · Yutao Zhong

Label Distribution Propagation-based Label Completion for Crowdsourcing

Tong Wu · Liangxiao Jiang · Wenjun Zhang · Chaoqun Li

Text-to-LoRA: Instant Transformer Adaption

Rujikorn Charakorn · Edoardo Cetin · Yujin Tang · Robert Lange

In-Context Learning and Occam's Razor

Eric Elmoznino · Tom Marty · Tejas Kasetty · Léo Gagnon · Sarthak Mittal · Mahan Fathi · Dhanya Sridhar · Guillaume Lajoie

Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers

Filip Szatkowski · Yaoyue Zheng · Fei Yang · Tomasz Trzcinski · Bartłomiej Twardowski · Joost van de Weijer

Zero-Shot Adaptation of Parameter-Efficient Fine-Tuning in Diffusion Models

Farzad Farhadzadeh · Debasmit Das · Shubhankar Borse · Fatih Porikli

BECAME: Bayesian Continual Learning with Adaptive Model Merging

Mei Li · Yuxiang Lu · Qinyan Dai · Suizhi Huang · Yue Ding · Hongtao Lu

CALM: Consensus-Aware Localized Merging for Multi-Task Learning

Kunda Yan · Min Zhang · Sen Cui · Qu Zikun · Bo Jiang · Feng Liu · Changshui Zhang

QT-DoG: Quantization-Aware Training for Domain Generalization

Saqib Javed · Hieu Le · Mathieu Salzmann

Learning Time-Aware Causal Representation for Model Generalization in Evolving Domains

Zhuo He · Shuang Li · Wenze Song · Longhui Yuan · Jian Liang · Han Li · Kun Gai

Stray Intrusive Outliers-Based Feature Selection on Intra-Class Asymmetric Instance Distribution or Multiple High-Density Clusters

Lixin Yuan · Yirui Wu · WENXIAO ZHANG · Minglei Yuan · Jun Liu

Prediction models that learn to avoid missing values

Lena Stempfle · Anton Matsson · Newton Mwai · Fredrik Johansson

Survival Analysis via Density Estimation

Hiroki Yanagisawa · Shunta Akiyama

Learning Changes in Graphon Attachment Network Models

Xinyuan Fan · Bufan Li · Chenlei Leng · Weichi Wu

Time Series Representations with Hard-Coded Invariances

Thibaut Germain · Chrysoula Kosma · Laurent Oudre

Sample Complexity of Correlation Detection in the Gaussian Wigner Model

Dong Huang · Pengkun Yang

Relational Conformal Prediction for Correlated Time Series

Andrea Cini · Alexander Jenkins · Danilo Mandic · Cesare Alippi · Filippo Maria Bianchi

Complete-Tree Space Favors Data-Efficient Link Prediction

Chi Gao · Lukai Li · Yancheng Zhou · Shangqi Guo

Contextures: Representations from Contexts

Runtian Zhai · Kai Yang · Burak VARICI · Che-Ping Tsai · Zico Kolter · Pradeep Ravikumar

One Example Shown, Many Concepts Known! Counterexample-Driven Conceptual Reasoning in Mathematical LLMs

Yinghui Li · Jiayi Kuang · Haojing Huang · Zhikun Xu · Xinnian Liang · Yi Yu · Wenlian Lu · Yangning Li · Xiaoyu Tan · Chao Qu · Ying Shen · Hai-Tao Zheng · Philip Yu

PaperBench: Evaluating AI’s Ability to Replicate AI Research

Giulio Starace · Oliver Jaffe · Dane Sherburn · James Aung · Jun Shern Chan · Leon Maksin · Rachel Dias · Evan Mays · Benjamin Kinsella · Wyatt Thompson · Johannes Heidecke · Amelia Glaese · Tejal Patwardhan

MathConstruct: Challenging LLM Reasoning with Constructive Proofs

Mislav Balunovic · Jasper Dekoninck · Nikola Jovanović · Ivo Petrov · Martin Vechev

RBench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation

Meng-Hao Guo · Jiajun Xu · Yi Zhang · Jiaxi Song · Haoyang Peng · Yi-Xuan Deng · Xinzhi Dong · Kiyohiro Nakayama · Zhengyang Geng · Chen Wang · Bolin Ni · Guo-Wei Yang · Yongming Rao · Houwen Peng · Han Hu · Gordon Wetzstein · Shi-min Hu

HiRemate: Hierarchical Approach for Efficient Re-materialization of Neural Networks

Julia Gusak · Xunyi Zhao · Théotime Le Hellard · Zhe LI · Lionel Eyraud-Dubois · Olivier Beaumont

BaWA: Automatic Optimizing Pruning Metric for Large Language Models with Balanced Weight and Activation

Lian Liu · Xiandong Zhao · Guanchen Li · Dong Li · Wang · Yinhe Han · Xiaowei Li · ying wang

Fusing Reward and Dueling Feedback in Stochastic Bandits

Xuchuang Wang · Qirun Zeng · Jinhang Zuo · Xutong Liu · Mohammad Hajiesmaili · John C. S. Lui · Adam Wierman

Stochastic Online Conformal Prediction with Semi-Bandit Feedback

Haosen Ge · Hamsa Bastani · Osbert Bastani

MOGIC: Metadata-infused Oracle Guidance for Improved Extreme Classification

Suchith Chidananda Prabhu · Bhavyajeet Singh · Anshul Mittal · Siddarth Asokan · Shikhar Mohan · Deepak Saini · Yashoteja Prabhu · Lakshya Kumar · Jian Jiao · Amit Singh · Niket Tandon · Manish Gupta · Sumeet Agarwal · Manik Varma

Rethinking Point Cloud Data Augmentation: Topologically Consistent Deformation

Jian Bi · Qianliang Wu · Xiang Li · Shuo Chen · Jianjun Qian · lei luo · Jian Yang

Nonparametric Identification of Latent Concepts

Yujia Zheng · Shaoan Xie · Kun Zhang

Volume-Aware Distance for Robust Similarity Learning

Shuo Chen · Chen Gong · Jun Li · Jian Yang

Learning Distances from Data with Normalizing Flows and Score Matching

Peter Sorrenson · Daniel Behrend-Uriarte · Christoph Schnörr · Ullrich Koethe

Model Steering: Learning with a Reference Model Improves Generalization Bounds and Scaling Laws

Xiyuan Wei · Ming Lin · Fanjiang Ye · Fengguang Song · Liangliang Cao · My T. Thai · Tianbao Yang

Scalable Attribute-Missing Graph Clustering via Neighborhood Differentiation

Yaowenhu · Wenxuan Tu · Yue Liu · Xinhang Wan · Junyi Yan · Taichun Zhou · Xinwang Liu

Automatically Identify and Rectify: Robust Deep Contrastive Multi-view Clustering in Noisy Scenarios

xihong yang · Siwei Wang · Fangdi Wang · Jiaqi Jin · Suyuan Liu · Yue Liu · En Zhu · Xinwang Liu · Yueming Jin

Adjustment for Confounding using Pre-Trained Representations

Rickmer Schulte · David Rügamer · Thomas Nagler

Heterogeneous Treatment Effect in Time-to-Event Outcomes: Harnessing Censored Data with Recursively Imputed Trees

Tomer Meir · Uri Shalit · Malka Gorfine

AutoCATE: End-to-End, Automated Treatment Effect Estimation

Toon Vanderschueren · Tim Verdonck · Mihaela van der Schaar · Wouter Verbeke

Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing

Ke Zhu · Shu Yang · Xiaofei Wang

A Meta-learner for Heterogeneous Effects in Difference-in-Differences

Hui Lan · Chang · Eleanor W Dillon · Vasilis Syrgkanis

Hierarchical Reinforcement Learning with Targeted Causal Interventions

Mohammadsadegh Khorasani · Saber Salehkaleybar · Negar Kiyavash · Matthias Grossglauser

Linear Contextual Bandits With Interference

Yang Xu · Wenbin Lu · Rui Song

Extracting Rare Dependence Patterns via Adaptive Sample Reweighting

Yiqing Li · Yewei Xia · Xiaofei Wang · Zhengming Chen · Liuhua Peng · Mingming Gong · Kun Zhang

Active Treatment Effect Estimation via Limited Samples

Zhiheng Zhang · Haoxiang Wang · Haoxuan Li · Zhouchen Lin

Strong and Weak Identifiability of Optimization-based Causal Discovery in Non-linear Additive Noise Models

Mingjia Li · Hong Qian · Tian-Zuo Wang · ShujunLi · Min Zhang · Aimin Zhou

Continuous Bayesian Model Selection for Multivariate Causal Discovery

Anish Dhir · Ruby Sedgwick · Avinash Kori · Ben Glocker · Mark van der Wilk

Fast Min-$\epsilon$ Segmented Regression using Constant-Time Segment Merging

Ansgar Lößer · Max Schlecht · Florian Schintke · Joel Witzke · Matthias Weidlich · Björn Scheuermann

UltraTWD: Optimizing Ultrametric Trees for Tree-Wasserstein Distance

Fangchen Yu · Yanzhen Chen · Jiaxing Wei · Jianfeng Mao · Wenye Li · Qiang Sun

Branches: Efficiently Seeking Optimal Sparse Decision Trees via AO*

Ayman Chaouki · Jesse Read · Albert Bifet

Solving Satisfiability Modulo Counting Exactly with Probabilistic Circuits

Jinzhao Li · Nan Jiang · Yexiang Xue

Locality Preserving Markovian Transition for Instance Retrieval

Jifei Luo · Wenzheng Wu · Hantao Yao · Lu Yu · Changsheng Xu

Conformity Score Averaging for Classification

Rui Luo · Zhixin Zhou

False Coverage Proportion Control for Conformal Prediction

Alexandre Blain · Thirion Bertrand · Pierre Neuvial

A Variational Information Theoretic Approach to Out-of-Distribution Detection

Sudeepta Mondal · Zhuolin Jiang · Ganesh Sundaramoorthi

Hierarchical Refinement: Optimal Transport to Infinity and Beyond

Peter Halmos · Julian Gold · Xinhao Liu · Benjamin Raphael

Deep Sturm–Liouville: From Sample-Based to 1D Regularization with Learnable Orthogonal Basis Functions

David Vigouroux · Joseba Dalmau · Louis Béthune · Victor Boutin

A Certified Unlearning Approach without Access to Source Data

Umit Basaran · Sk Miraj Ahmed · Amit Roy-Chowdhury · Basak Guler

One Arrow, Two Hawks: Sharpness-aware Minimization for Federated Learning via Global Model Trajectory

Yuhang Li · Tong Liu · Yangguang Cui · Ming Hu · Xiaoqiang Li

Does One-shot Give the Best Shot? Mitigating Model Inconsistency in One-shot Federated Learning

Hui Zeng · Wenke Huang · Tongqing Zhou · Xinyi Wu · Guancheng Wan · Yingwen Chen · CAI ZHIPING

Prior Knowledge Guided Neural Architecture Generation

Jingrong Xie · Han Ji · Yanan Sun

Stochastic Encodings for Active Feature Acquisition

Alexander Norcliffe · Changhee Lee · Fergus Imrie · Mihaela van der Schaar · Pietro Lió

FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks

Laines Schmalwasser · Niklas Penzel · Joachim Denzler · Julia Niebling

Curvature Enhanced Data Augmentation for Regression

Ilya Kaufman · Omri Azencot

On Temperature Scaling and Conformal Prediction of Deep Classifiers

Lahav Dabah · Tom Tirer

The Hidden Joules: Evaluating the Energy Consumption of Vision Backbones for Progress Towards More Efficient Model Inference

Zeyu Yang · Wesley Armour

The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions

Gül Sena Altıntaş · Devin Kwok · Colin Raffel · David Rolnick

Floating-Point Neural Networks Can Represent Almost All Floating-Point Functions

Geonho Hwang · Yeachan Park · Wonyeol Lee · Sejun Park

Adaptive kernel predictors from feature-learning infinite limits of neural networks

Clarissa Lauditi · Blake Bordelon · Cengiz Pehlevan

On the Local Complexity of Linear Regions in Deep ReLU Networks

Niket Patel · Guido Montufar

Fishers for Free? Approximating the Fisher Information Matrix by Recycling the Squared Gradient Accumulator

YuXin Li · Felix Dangel · Derek Tam · Colin Raffel

Optimization for Neural Operators can Benefit from Width

Pedro Cisneros-Velarde · Bhavesh Shrimali · Arindam Banerjee

IMTS is Worth Time $\times$ Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction

Zhangyi Hu · Jiemin Wu · Hua XU · Mingqian Liao · Ninghui Feng · Bo Gao · Songning Lai · Yutao Yue

Probably Approximately Global Robustness Certification

Peter Blohm · Patrick Indri · Thomas Gärtner · SAGAR MALHOTRA

Adapting to Evolving Adversaries with Regularized Continual Robust Training

Sihui Dai · Christian Cianfarani · Vikash Sehwag · Prateek Mittal · Arjun Bhagoji

SMART-PC: Skeletal Model Adaptation for Robust Test-Time Training in Point Clouds

Ali Bahri · Moslem Yazdanpanah · Sahar Dastani Oghani · Mehrdad Noori · Gustavo Vargas Hakim · David OSOWIECHI · Farzad Beizaee · Ismail Ben Ayed · Christian Desrosiers

CTBench: A Library and Benchmark for Certified Training

Yuhao Mao · Stefan Balauca · Martin Vechev

Targeted Unlearning with Single Layer Unlearning Gradient

Zikui Cai · Yaoteng Tan · M. Salman Asif

An Augmentation-Aware Theory for Self-Supervised Contrastive Learning

Jingyi Cui · Hongwei Wen · Yisen Wang

Patch-wise Structural Loss for Time Series Forecasting

Dilfira Kudrat · Zongxia Xie · Yanru Sun · Tianyu Jia · Qinghua Hu

AEQA-NAT : Adaptive End-to-end Quantization Alignment Training Framework for Non-autoregressive Machine Translation

Xiangyu Qu · Guojing Liu · Liang Li

Discovering Physics Laws of Dynamical Systems via Invariant Function Learning

Shurui Gui · Xiner Li · Shuiwang Ji

Bayesian Basis Function Approximation for Scalable Gaussian Process Priors in Deep Generative Models

Mehmet Yiğit Balık · Maksim Sinelnikov · Priscilla Ong · Harri Lähdesmäki

Non-stationary Diffusion For Probabilistic Time Series Forecasting

Weiwei Ye · Zhuopeng Xu · Ning Gui

When Will It Fail?: Anomaly to Prompt for Forecasting Future Anomalies in Time Series

Min-Yeong Park · Won-Jeong Lee · Seong Tae Kim · Gyeong-Moon Park

Towards a General Time Series Forecasting Model with Unified Representation and Adaptive Transfer

Yihang Wang · Yuying Qiu · Peng Chen · Kai Zhao · Yang Shu · Zhongwen Rao · Lujia Pan · Bin Yang · Chenjuan Guo

Unveiling AI's Blind Spots: An Oracle for In-Domain, Out-of-Domain, and Adversarial Errors

Shuangpeng Han · Mengmi Zhang

The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

Zichao Li · Xueru Wen · Jie Lou · Yuqiu Ji · Yaojie Lu · Xianpei Han · Debing Zhang · Le Sun

AdvAgent: Controllable Blackbox Red-teaming on Web Agents

Chejian Xu · Mintong Kang · Jiawei Zhang · Zeyi Liao · Lingbo Mo · Mengqi Yuan · Huan Sun · Bo Li

Ranked Entropy Minimization for Continual Test-Time Adaptation

Jisu Han · Jaemin Na · Wonjun Hwang

Mahalanobis++: Improving OOD Detection via Feature Normalization

Maximilian Müller · Matthias Hein

Efficient Robust Conformal Prediction via Lipschitz-Bounded Networks

Thomas Massena · Léo Andéol · Thibaut Boissin · Franck Mamalet · Corentin FRIEDRICH · Mathieu Serrurier · Sébastien Gerchinovitz

Self-supervised Adversarial Purification for Graph Neural Networks

Woohyun Lee · Hogun Park

Beyond One-Hot Labels: Semantic Mixing for Model Calibration

Haoyang Luo · Linwei Tao · Minjing Dong · Chang Xu

$\texttt{I$^2$MoE}$: Interpretable Multimodal Interaction-aware Mixture-of-Experts

Jiayi Xin · Sukwon Yun · Jie Peng · Inyoung Choi · Jenna Ballard · Tianlong Chen · Qi Long

Multi-band Frequency Reconstruction for Neural Psychoacoustic Coding

Dianwen Ng · Kun Zhou · Yi-Wen Chao · Zhiwei Xiong · Bin Ma · EngSiong Chng

Canonical Rank Adaptation: An Efficient Fine-Tuning Strategy for Vision Transformers

Lokesh Veeramacheneni · Moritz Wolter · Hilde Kuehne · Juergen Gall

CPCF: A Cross-Prompt Contrastive Framework for Referring Multimodal Large Language Models

Lanyun Zhu · Deyi Ji · Tianrun Chen · Haiyang Wu · De Wen Soh · Jun Liu

NoLiMa: Long-Context Evaluation Beyond Literal Matching

Ali Modarressi · Hanieh Deilamsalehy · Franck Dernoncourt · Trung Bui · Ryan A Rossi · Seunghyun Yoon · Hinrich Schuetze

Defending LVLMs Against Vision Attacks Through Partial-Perception Supervision

Qi Zhou · Dongxia Wang · Tianlin Li · Yun Lin · Yang Liu · Jin Song Dong · Qing Guo

Teaching Physical Awareness to LLMs through Sounds

Weiguo Wang · Andy Nie · Wenrui Zhou · Yi Kai · Chengchen Hu

IBCircuit: Towards Holistic Circuit Discovery with Information Bottleneck

Tian Bian · Yifan Niu · Chaohao Yuan · Chengzhi Piao · Bingzhe Wu · Long-Kai Huang · Yu Rong · Tingyang Xu · Hong Cheng · Jia Li

Accelerating Unbiased LLM Evaluation via Synthetic Feedback

Zhaoyi Zhou · Yuda Song · Andrea Zanette

WikiBigEdit: Understanding the Limits of Lifelong Knowledge Editing in LLMs

Lukas Thede · Karsten Roth · Matthias Bethge · Zeynep Akata · Thomas Hartvigsen

(How) Do Language Models Track State?

Belinda Li · Carl Guo · Jacob Andreas

PDE-Controller: LLMs for Autoformalization and Reasoning of PDEs

Mauricio Soroco · Jialin Song · Mengzhou Xia · Kye Emond · Weiran Sun · Wuyang Chen

Reinforce LLM Reasoning through Multi-Agent Reflection

Yurun Yuan · Tengyang Xie

Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs

Sagnik Mukherjee · Abhinav Chinta · Takyoung Kim · Tarun Anoop Sharma · Dilek Hakkani-Tür

PoisonBench: Assessing Language Model Vulnerability to Poisoned Preference Data

Tingchen Fu · Mrinank Sharma · Phil Torr · Shay Cohen · David Krueger · Fazl Barez

Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data

Guan Zhong · Likang Wu · Hongke Zhao · Ming He · Jianping Fan

Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

Liang CHEN · Xueting Han · Li Shen · Jing Bai · Kam-Fai Wong

Idiosyncrasies in Large Language Models

Mingjie Sun · Yida Yin · Zhiqiu (Oscar) Xu · Zico Kolter · Zhuang Liu

Active Reward Modeling: Adaptive Preference Labeling for Large Language Model Alignment

Yunyi Shen · Hao Sun · Jean-Francois Ton

Understanding Chain-of-Thought in LLMs through Information Theory

Jean-Francois Ton · Muhammad Faaiz Taufiq · Yang Liu

Improving Model Alignment Through Collective Intelligence of Open-Source Models

Junlin Wang · Roy Xie · Shang Zhu · Jue Wang · Ben Athiwaratkun · Bhuwan Dhingra · Shuaiwen Song · Ce Zhang · James Zou

KV Shifting Attention Enhances Language Modeling

Mingyu Xu · Bingning Wang · Weipeng Chen

DA-KD: Difficulty-Aware Knowledge Distillation for Efficient Large Language Models

Changyi He · Yifu Ding · Jinyang Guo · Ruihao Gong · Haotong Qin · Xianglong Liu

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Akhiad Bercovich · Tomer Ronen · Talor Abramovich · Nir Ailon · Nave Assaf · Mohammed Dabbah · Ido Galil · Amnon Geifman · Yonatan Geifman · Izhak Golan · Netanel Haber · Ehud Karpas · Roi Koren · Itay Levy · Pavlo Molchanov · Shahar Mor · Zach Moshe · Najeeb Nabwani · Omri Puny · Ran Rubin · Itamar Schen · Ido Shahaf · Oren Tropp · Omer Argov · Ran Zilberstein · Ran El-Yaniv

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Daniil Laptev · Nikita Balagansky · Yaroslav Aksenov · Daniil Gavrilov

Steer LLM Latents for Hallucination Detection

Seongheon Park · Xuefeng Du · Min-Hsuan Yeh · Haobo Wang · Sharon Li

Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models

peijie liu · Fengli Xu · Yong Li

Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization

Deyuan Liu · Zecheng Wang · Bingning Wang · Weipeng Chen · Chunshan Li · Zhiying Tu · Dianhui Chu · Dianbo Sui

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Maohao Shen · Guangtao Zeng · Zhenting Qi · Zhang-Wei Hong · Zhenfang Chen · Wei Lu · Gregory Wornell · Subhro Das · David Cox · Chuang Gan

Preference Learning for AI Alignment: a Causal Perspective

Katarzyna Kobalczyk · Mihaela van der Schaar

FloE: On-the-Fly MoE Inference on Memory-constrained GPU

Yuxin Zhou · Zheng Li · Jun Zhang · Jue Wang · Yiping Wang · Zhongle Xie · Ke Chen · Lidan Shou

Olica: Efficient Structured Pruning of Large Language Models without Retraining

Jiujun He · Huazhen Lin

Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning

Andy (DiJia) Su · Hanlin Zhu · Yingchen Xu · Jiantao Jiao · Yuandong Tian · Qinqing Zheng

To Steer or Not to Steer? Mechanistic Error Reduction with Abstention for Language Models

Anna Hedström · Salim I. Amoukou · Tom Bewley · Saumitra Mishra · Manuela Veloso

Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks

Yixin Cheng · Hongcheng Guo · Yangming Li · Leonid Sigal

Unlocking Post-hoc Dataset Inference with Synthetic Data

Bihe Zhao · Pratyush Maini · Franziska Boenisch · Adam Dziedzic

ProSec: Fortifying Code LLMs with Proactive Security Alignment

Xiangzhe Xu · Zian Su · Jinyao Guo · Kaiyuan Zhang · Zhenting Wang · Xiangyu Zhang

Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries

Junhyuck Kim · Jongho Park · Jaewoong Cho · Dimitris Papailiopoulos

PIPA: Preference Alignment as Prior-Informed Statistical Estimation

Junbo Li · Zhangyang “Atlas” Wang · qiang liu

Regress, Don't Guess: A Regression-like Loss on Number Tokens for Language Models

Jonas Zausinger · Lars Pennig · Anamarija Kozina · Sean Sdahl · Julian Sikora · Adrian Dendorfer · Timofey Kuznetsov · Mohamad Hagog · Nina Wiedemann · Kacper Chlodny · Vincent Limbach · Anna Ketteler · Thorben Prein · Vishwa Singh · Michael Danziger · Jannis Born

Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective

Jianyu Wang · Zhiqiang Hu · Lidong Bing

Unnatural Languages Are Not Bugs but Features for LLMs

Keyu Duan · Yiran Zhao · Zhili Feng · Jinjie Ni · Tianyu Pang · Qian Liu · Tianle Cai · Longxu Dou · Kenji Kawaguchi · Anirudh Goyal · Zico Kolter · Michael Shieh

LongRoPE2: Near-Lossless LLM Context Window Scaling

Ning Shang · Li Lyna Zhang · Siyuan Wang · Gaokai Zhang · Gilsinia Lopez · Fan Yang · Weizhu Chen · Mao Yang

Potemkin Understanding in Large Language Models

Marina Mancoridis · Bec Weeks · Keyon Vafa · Sendhil Mullainathan

A Tale of Two Structures: Do LLMs Capture the Fractal Complexity of Language?

Ibrahim Alabdulmohsin · Andreas Steiner

Reflection-Window Decoding: Text Generation with Selective Refinement

Zeyu Tang · Zhenhao Chen · Xiangchen Song · Loka Li · Yunlong Deng · Yifan Shen · Guangyi Chen · Peter Spirtes · Kun Zhang

MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design

Haojie Duanmu · Xiuhong Li · Zhihang Yuan · Size Zheng · Jiangfei Duan · Xingcheng ZHANG · Dahua Lin

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Simon Park · Abhishek Panigrahi · Yun Cheng · Dingli Yu · Anirudh Goyal · Sanjeev Arora

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Avery Ma · Yangchen Pan · Amir-massoud Farahmand

SNS-Bench: Defining, Building, and Assessing Capabilities of Large Language Models in Social Networking Services

Hongcheng Guo · Yue Wang · Shaosheng Cao · Fei zhao · Boyang Wang · Lei Li · Liang Chen · Xinze Lyu · Zhe Xu · Yao Hu · Zhoujun Li

CommVQ: Commutative Vector Quantization for KV Cache Compression

Junyan Li · Yang Zhang · Muhammad Yusuf Hassan · Talha Chafekar · Tianle Cai · Zhile Ren · Pengsheng Guo · Foroozan Karimzadeh · Colorado Reed · Chong Wang · Chuang Gan

Evaluating LLMs Across Multi-Cognitive Levels: From Medical Knowledge Mastery to Scenario-Based Problem Solving

Yuxuan Zhou · Xien Liu · Chenwei Yan · Chen Ning · Xiao Zhang · Boxun Li · Xiangling Fu · Shijin Wang · Guoping Hu · Yu Wang · Ji Wu

Efficient Graph Continual Learning via Lightweight Graph Neural Tangent Kernels-based Dataset Distillation

Rihong Qiu · Xinke Jiang · Yuchen Fang · Hongbin Lai · Hao Miao · Xu Chu · Junfeng Zhao · Yasha Wang

The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products

YuQing Xie · Ameya Daigavane · Mit Kotak · Tess Smidt

Test-Time Graph Neural Dataset Search With Generative Projection

Xin Zheng · Wei Huang · Chuan Zhou · Ming Li · Shirui Pan

CSG-ODE: ControlSynth Graph ODE For Modeling Complex Evolution of Dynamic Graphs

Zhiqiang Wang · Xiaoyi Wang · Jianqing Liang

A Manifold Perspective on the Statistical Generalization of Graph Neural Networks

Zhiyang Wang · Juan Cervino · Alejandro Ribeiro

Structure Is All You Need: Structural Representation Learning on Hyper-Relational Knowledge Graphs

Jaejun Lee · Joyce Whang

Self-Improving Language Models for Evolutionary Program Synthesis: A Case Study on ARC-AGI

Julien Pourcel · Cédric Colas · Pierre-Yves Oudeyer

Mixture of Lookup Experts

Shibo Jie · Yehui Tang · Kai Han · Yitong Li · Duyu Tang · Zhi-Hong Deng · Yunhe Wang

CRANE: Reasoning with constrained LLM generation

Debangshu Banerjee · Tarun Suresh · Shubham Ugare · Sasa Misailovic · Gagandeep Singh

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies

Nadav Timor · Jonathan Mamou · Daniel Korat · Moshe Berchansky · Gaurav Jain · Oren Pereg · Moshe Wasserblat · David Harel

What Makes In-context Learning Effective for Mathematical Reasoning

Jiayu Liu · Zhenya Huang · Chaokun Wang · Xunpeng Huang · Chengxiang Zhai · Enhong Chen

any4: Learned 4-bit Numeric Representation for LLMs

Mostafa Elhoushi · Jeff Johnson

CurvGAD: Leveraging Curvature for Enhanced Graph Anomaly Detection

Karish Grover · Geoff Gordon · Christos Faloutsos

Implicit Subgraph Neural Network

Yongjian Zhong · Liao Zhu · Hieu Vu · Bijaya Adhikari

Enhancing the Influence of Labels on Unlabeled Nodes in Graph Convolutional Networks

Jincheng Huang · Yujie Mo · Xiaoshuang Shi · Lei Feng · Xiaofeng Zhu

How Much Can Transfer? BRIDGE: Bounded Multi-Domain Graph Foundation Model with Generalization Guarantees

Haonan Yuan · Qingyun Sun · Junhua Shi · Xingcheng Fu · Bryan Hooi · Jianxin Li · Philip Yu

Implicit degree bias in the link prediction task

Rachith Aiyappa · Xin Wang · Munjung Kim · Ozgur Can Seckin · Yong-Yeol Ahn · Sadamori Kojaku

Redundancy Undermines the Trustworthiness of Self-Interpretable GNNs

Wenxin Tai · Ting Zhong · Goce Trajcevski · Fan Zhou

Learn Beneficial Noise as Graph Augmentation

Siqi Huang · Yanchen Xu · Hongyuan Zhang · Xuelong Li

Toward Data-centric Directed Graph Learning: An Entropy-driven Approach

Xunkai Li · Zhengyu Wu · Kaichi Yu · Hongchao Qin · Guang Zeng · Rong-Hua Li · Guoren Wang

SCENIR: Visual Semantic Clarity through Unsupervised Scene Graph Retrieval

Nikolaos Chaidos · Angeliki Dimitriou · Maria Lymperaiou · Giorgos Stamou

Overcoming Spurious Solutions in Semi-Dual Neural Optimal Transport: A Smoothing Approach for Learning the Optimal Transport Plan

Jaemoo Choi · Jaewoong Choi · Dohyun Kwon

Ultra-Resolution Adaptation with Ease

Ruonan Yu · Songhua Liu · Zhenxiong Tan · Xinchao Wang

Geometry Informed Tokenization of Molecules for Language Model Generation

Xiner Li · Limei Wang · Youzhi Luo · Carl Edwards · Shurui Gui · Yuchao Lin · Heng Ji · Shuiwang Ji

Kinetic Langevin Diffusion for Crystalline Materials Generation

François Cornet · Federico Bergamin · Arghya Bhowmik · Juan Garcia-Lastra · Jes Frellsen · Mikkel Schmidt

SADA: Stability-guided Adaptive Diffusion Acceleration

Ting Jiang · Yixiao Wang · Hancheng Ye · Zishan Shao · Jingwei Sun · Jingyang Zhang · Zekai Chen · Jianyi Zhang · Yiran Chen · Hai Li

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Alec Helbling · Tuna Han Salih Meral · Benjamin Hoover · Pinar Yanardag · Polo Chau

MGD$^3$ : Mode-Guided Dataset Distillation using Diffusion Models

Jeffrey A. Chan-Santiago · praveen tirupattur · Gaurav Kumar Nayak · Gaowen Liu · Mubarak Shah

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation

Philippe Hansen-Estruch · David Yan · Ching-Yao Chuang · Orr Zohar · Jialiang Wang · Tingbo Hou · Tao Xu · Sriram Vishwanath · Peter Vajda · Xinlei Chen

DeFoG: Discrete Flow Matching for Graph Generation

Yiming Qin · Manuel Madeira · Dorina Thanou · Pascal Frossard

Understanding and Mitigating Memorization in Diffusion Models for Tabular Data

Zhengyu Fang · Zhimeng Jiang · Huiyuan Chen · Xiao Li · Jing Li

Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation

Soobin Um · Beomsu Kim · Jong Chul YE

Scaling Laws for Pre-training Agents and World Models

Tim Pearce · Tabish Rashid · David Bignell · Raluca Georgescu · Sam Devlin · Katja Hofmann

Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes

Dongjae Jeon · Dueun Kim · Albert No

Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

Jaeyeon Kim · Kulin Shah · Vasilis Kontonis · Sham Kakade · Sitan Chen

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Roman Bachmann · Jesse Allardice · David Mizrahi · Enrico Fini · Oguzhan Fatih Kar · Elmira Amirloo · Alaaeldin Ali · Amir Zamir · Afshin Dehghan

Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models

Rafał Karczewski · Markus Heinonen · Vikas Garg

Accelerated Diffusion Models via Speculative Sampling

Valentin De Bortoli · Alexandre Galashov · Arthur Gretton · Arnaud Doucet

Inverse Bridge Matching Distillation

Nikita Gushchin · David Li · Daniil Selikhanovych · Evgeny Burnaev · Dmitry Baranchuk · Aleksandr Korotin

Noise Conditional Variational Score Distillation

Xinyu Peng · Ziyang Zheng · Yaoming Wang · Han Li · Nuowen Kan · Wenrui Dai · Chenglin Li · Junni Zou · Hongkai Xiong

D-Fusion: Direct Preference Optimization for Aligning Diffusion Models with Visually Consistent Samples

Zijing Hu · Fengda Zhang · Kun Kuang

Shielded Diffusion: Generating Novel and Diverse Images using Sparse Repellency

Michael Kirchhof · James Thornton · Louis Béthune · Pierre Ablin · Eugene Ndiaye · Marco Cuturi

Efficiently Access Diffusion Fisher: Within the Outer Product Span Space

Fangyikang Wang · Hubery Yin · Shaobin Zhuang · Huminhao Zhu · Yinan Li · Lei Qian · Chao Zhang · Hanbin Zhao · Hui Qian · Chen Li

Compressed Image Generation with Denoising Diffusion Codebook Models

Guy Ohayon · Hila Manor · Tomer Michaeli · Michael Elad

Identifiable Object Representations under Spatial Ambiguities

Avinash Kori · Francesca Toni · Ben Glocker

Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance

Marta Gentiloni Silveri · Antonio Ocello

LGDM: Latent Guidance in Diffusion Models for Perceptual Evaluations

Shreshth Saini · Ru-Ling Liao · Yan Ye · Alan Bovik

Concept Reachability in Diffusion Models: Beyond Dataset Constraints

Marta Aparicio Rodriguez · Xenia Miscouridou · Anastasia Borovykh

Inductive Moment Matching

Linqi (Alex) Zhou · Stefano Ermon · Jiaming Song

Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual Reasoning

Fan Shi · Bin Li · Xiangyang Xue

Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation

Hoigi Seo · Wongi Jeong · Jae-sun Seo · Se Young Chun

Orthus: Autoregressive Interleaved Image-Text Generation with Modality-Specific Heads

Siqi Kou · Jiachun Jin · Zhihong Liu · Chang Liu · Ye Ma · jian jia · Quan Chen · Peng Jiang · Zhijie Deng

Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models

Daiki Chijiwa · Taku Hasegawa · Kyosuke Nishida · Kuniko Saito · Susumu Takeuchi

In-Context Deep Learning via Transformer Models

Weimin Wu · Maojiang Su · Jerry Yao-Chieh Hu · Zhao Song · Han Liu

The Double-Ellipsoid Geometry of CLIP

Meir Yossef Levi · Guy Gilboa

Sundial: A Family of Highly Capable Time Series Foundation Models

Yong Liu · Guo Qin · Zhiyuan Shi · Zhi Chen · Caiyin Yang · Xiangdong Huang · Jianmin Wang · Mingsheng Long

Are Large Brainwave Foundation Models Capable Yet ? Insights from Fine-Tuning

Na Lee · Konstantinos Barmpas · Yannis Panagakis · Dimitrios Adamos · Nikolaos Laskaris · Stefanos Zafeiriou

Whitened CLIP as a Likelihood Surrogate of Images and Captions

Roy Betser · Meir Yossef Levi · Guy Gilboa

How Far Is Video Generation from World Model: A Physical Law Perspective

Bingyi Kang · Yang Yue · Rui Lu · Zhijie Lin · Yang Zhao · Kaixin Wang · Gao Huang · Jiashi Feng

TabICL: A Tabular Foundation Model for In-Context Learning on Large Data

Jingang QU · David Holzmüller · Gael Varoquaux · Marine Le Morvan

Score-of-Mixture Training: One-Step Generative Model Training Made Simple via Score Estimation of Mixture Distributions

Tejas Jayashankar · Jongha (Jon) Ryu · Gregory Wornell

Vector Grimoire: Codebook-based Shape Generation under Raster Image Supervision

Marco Cipriano · Moritz Feuerpfeil · Gerard de Melo

Latent Thought Models with Variational Bayes Inference-Time Computation

Deqian Kong · Minglu Zhao · Dehong Xu · Bo Pang · Shu Wang · Edouardo Honig · Zhangzhang Si · Chuan Li · Jianwen Xie · Sirui Xie · Ying Nian Wu

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Kaizhen Zhu · Mokai Pan · Yuexin Ma · Yanwei Fu · Jingyi Yu · Jingya Wang · Ye Shi

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits

Zikai Zhou · Qizheng Zhang · Hermann Kumbong · Kunle Olukotun

WildChat-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Benjamin Feuer · Chinmay Hegde

On Understanding Attention-Based In-Context Learning for Categorical Data

Aaron Wang · William Convertino · Xiang Cheng · Ricardo Henao · Lawrence Carin

Benign Overfitting in Token Selection of Attention Mechanism

Keitaro Sakamoto · Issei Sato

AdaSplash: Adaptive Sparse Flash Attention

Nuno Gonçalves · Marcos V. Treviso · Andre Martins

Star Attention: Efficient LLM Inference over Long Sequences

Shantanu Acharya · Fei Jia · Boris Ginsburg

Sparse Video-Gen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Haocheng Xi · Shuo Yang · Yilong Zhao · Chenfeng Xu · Muyang Li · Xiuyu Li · Yujun Lin · Han Cai · Jintao Zhang · Dacheng Li · Jianfei Chen · Ion Stoica · Kurt Keutzer · Song Han

Re-ranking Reasoning Context with Tree Search Makes Large Vision-Language Models Stronger

Qi Yang · Chenghao Zhang · Lubin Fan · Kun Ding · Jieping Ye · Shiming Xiang

On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists

Dongyang Fan · Bettina Messmer · Nikita Doikov · Martin Jaggi

Beyond Low-rank Decomposition: A Shortcut Approach for Efficient On-Device Learning

Le-Trung Nguyen · Aël Quélennec · Van-Tam Nguyen · Enzo Tartaglione

SAND: One-Shot Feature Selection with Additive Noise Distortion

Pedram Pad · Hadi Hammoud · Mohamad Dia · nadim maamari · Liza Dunbar

Unisolver: PDE-Conditional Transformers Towards Universal Neural PDE Solvers

Hang Zhou · Yuezhou Ma · Haixu Wu · Haowen Wang · Mingsheng Long

Unlocking the Power of SAM 2 for Few-Shot Segmentation

Qianxiong Xu · Lanyun Zhu · Xuanyi Liu · Guosheng Lin · Cheng Long · Ziyue Li · Rui Zhao

Neural Genetic Search in Discrete Spaces

Hyeonah Kim · Sanghyeok Choi · Jiwoo Son · Jinkyoo Park · Changhyun Kwon

Semantic Shift Estimation via Dual-Projection and Classifier Reconstruction for Exemplar-Free Class-Incremental Learning

Run He · Di Fang · Yicheng Xu · Yawen Cui · Ming Li · Cen Chen · Ziqian Zeng · HUIPING ZHUANG

The Importance of Being Lazy: Scaling Limits of Continual Learning

Jacopo Graldi · Alessandro Breccia · Giulia Lanzillotta · Thomas Hofmann · Lorenzo Noci

Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data

David Heurtel-Depeiges · Anian Ruoss · Joel Veness · Tim Genewein

L3A: Label-Augmented Analytic Adaptation for Multi-Label Class Incremental Learning

Xiang Zhang · Run He · Chen Jiao · Di Fang · Ming Li · Ziqian Zeng · Cen Chen · HUIPING ZHUANG

Improved Algorithm for Deep Active Learning under Imbalance via Optimal Separation

Shyam Nuggehalli · Jifan Zhang · Lalit Jain · Robert Nowak

Position: AI Safety should prioritize the Future of Work

Sanchaita Hazra · Bodhisattwa Prasad Majumder · Tuhin Chakrabarty

Position: AI's growing due process problem

Sunayana Rane

Position: Generative AI Regulation Can Learn from Social Media Regulation

Ruth Elisabeth Appel

Position: Current Model Licensing Practices are Dragging Us into a Quagmire of Legal Noncompliance

Moming Duan · Mingzhe Du · Rui Zhao · Mengying Wang · Yinghui Wu · Nigel Shadbolt · Bingsheng He

Position: Editing Large Language Models Poses Serious Safety Risks

Paul Youssef · Zhixue Zhao · Daniel Braun · Jörg Schlötterer · Christin Seifert

Position: Machine Learning Models Have a Supply Chain Problem

Sarah Meiklejohn · Hayden Blauzvern · Mihai Maruseac · Spencer Schrock · Laurent Simon · Ilia Shumailov

Position: AI Agents Need Authenticated Delegation

Tobin South · Samuele Marro · Thomas Hardjono · Robert Mahari · Cedric Whitney · Alan Chan · Alex Pentland

Position: LLM Social Simulations Are a Promising Research Method

Jacy Anthis · Ryan Liu · Sean Richardson · Austin Kozlowski · Bernard Koch · Erik Brynjolfsson · James Evans · Michael Bernstein

Position: Challenges and Future Directions of Data-Centric AI Alignment

Min-Hsuan Yeh · Jeffrey Wang · Xuefeng Du · Seongheon Park · Leitian Tao · Shawn Im · Sharon Li

Position: Causal Machine Learning Requires Rigorous Synthetic Experiments for Broader Adoption

Audrey Poinsot · Panayiotis Panayiotou · Alessandro Leite · Nicolas CHESNEAU · Özgür Şimşek · Marc Schoenauer

Position: Formal Mathematical Reasoning—A New Frontier in AI

Kaiyu Yang · Gabriel Poesia · Jingxuan He · Wenda Li · Kristin Lauter · Swarat Chaudhuri · Dawn Song

Position: An Empirically Grounded Identifiability Theory Will Accelerate Self Supervised Learning Research

Patrik Reizinger · Randall Balestriero · David Klindt · Wieland Brendel

Position: Enough of Scaling LLMs! Lets Focus on Downscaling

Yash Goel · Ayan Sengupta · Tanmoy Chakraborty

Position: In-House Evaluation Is Not Enough. Towards Robust Third-Party Evaluation and Flaw Disclosure for General-Purpose AI

Shayne Longpre · Kevin Klyman · Ruth Elisabeth Appel · Sayash Kapoor · Rishi Bommasani · Michelle Sahar · Sean McGregor · Avijit Ghosh · Borhane Blili-Hamelin · Nathan Butters · Alondra Nelson · Amit Elazari · Andrew Sellars · Casey Ellis · Dane Sherrets · Dawn Song · Harley Geiger · Ilona Cohen · Lauren McIlvenny · Madhulika Srikumar · Mark Jaycox · Markus Anderljung · Nadine Johnson · Nicholas Carlini · Nicolas Miailhe · Nik Marda · Peter Henderson · Rebecca Portnoff · Rebecca Weiss · Victoria Westerhoff · Yacine Jernite · Rumman Chowdhury · Percy Liang · Arvind Narayanan

Position: We Need An Algorithmic Understanding of Generative AI

Oliver Eberle · Thomas McGee · Hamza Giaffar · Taylor Webb · Ida Momennejad

Position: Beyond Assistance – Reimagining LLMs as Ethical and Adaptive Co-Creators in Mental Health Care

Abeer Badawi · Md Tahmid Rahman Laskar · Jimmy Huang · Shaina Raza · Elham Dolatabadi

Position: Algebra Unveils Deep Learning - An Invitation to Neuroalgebraic Geometry

Giovanni Luca Marchetti · Vahid Shahverdi · Stefano Mereta · Matthew Trager · Kathlén Kohn

Position: Trustworthy AI Agents Require the Integration of Large Language Models and Formal Methods

Yedi Zhang · Yufan Cai · Xinyue Zuo · Xiaokun Luan · Kailong Wang · Zhe Hou · Yifan Zhang · Zhiyuan Wei · Meng Sun · Jun Sun · Jing Sun · Jin Song Dong

Position: Graph Matching Systems Deserve Better Benchmarks

Indradyumna Roy · Saswat Meher · Eeshaan Jain · Soumen Chakrabarti · Abir De

MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges

Shixi Qin · Zhiyong Yang · Shilong Bao · Shi Wang · Qianqian Xu · Qingming Huang

Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection

Anirudh Sundara Rajan · Yong Jae Lee

Adversaries Can Misuse Combinations of Safe Models

Erik Jones · Anca Dragan · Jacob Steinhardt

MemFreezing: A Novel Adversarial Attack on Temporal Graph Neural Networks under Limited Future Knowledge

Yue Dai · Liang Liu · Xulong Tang · Youtao Zhang · Jun Yang

GaussMarker: Robust Dual-Domain Watermark for Diffusion Models

Kecen Li · Zhicong Huang · Xinwen Hou · Cheng Hong

Does Low Rank Adaptation Lead to Lower Robustness against Training-Time Attacks?

Zi Liang · Haibo Hu · Qingqing Ye · Yaxin XIAO · RongHua Li

Black-Box Adversarial Attacks on LLM-Based Code Completion

Slobodan Jenko · Niels Mündler · Jingxuan He · Mark Vero · Martin Vechev

Byzantine-Resilient Federated Alternating Gradient Descent and Minimization for Partly-Decoupled Low Rank Matrix Learning

Ankit Pratap Singh · Ahmed Abbasi · Namrata Vaswani

Gandalf the Red: Adaptive Security for LLMs

Niklas Pfister · Václav Volhejn · Manuel Knott · Santiago Arias · Julia Bazinska · Mykhailo Bichurin · Alan Commike · Janet Darling · Peter Dienes · Matthew Fiedler · David Haber · Matthias Kraft · Marco Lancini · Max Mathys · Damian Pascual-Ortiz · Jakub Podolak · Adrià Romero-López · Kyriacos Shiarlis · Andreas Signer · Zsolt Terek · Athanasios Theocharis · Daniel Timbrell · Samuel Trautwein · Samuel Watts · Yun-Han Wu · Mateo Rojas-Carulla

WMarkGPT: Watermarked Image Understanding via Multimodal Large Language Models

Tan Songbai · Xuerui Qiu · Yao Shu · Gang Xu · Linrui Xu · Xiangyu Xu · HUIPING ZHUANG · Ming Li · Fei Yu

Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions

Yik Siu Chan · Narutatsu Ri · Yuxin Xiao · Marzyeh Ghassemi

MP-Nav: Enhancing Data Poisoning Attacks against Multimodal Learning

Jingfeng Zhang · Prashanth Krishnamurthy · Naman Patel · Anthony Tzes · Farshad Khorrami

Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens

Ting-Ji Huang · Jia-Qi Yang · Chunxu Shen · Kai-Qi Liu · De-Chuan Zhan · Han-Jia Ye

Learning Classifiers That Induce Markets

Yonatan Sommer · Ivri Hikri · lotan amit · Nir Rosenfeld

Certified Unlearning for Neural Networks

Anastasiia Koloskova · Youssef Allouah · Animesh Jha · Rachid Guerraoui · Sanmi Koyejo

Clients Collaborate: Flexible Differentially Private Federated Learning with Guaranteed Improvement of Utility-Privacy Trade-off

Yuecheng Li · Lele Fu · Tong Wang · Jian Lou · Bin Chen · Lei Yang · Jian Shen · Zibin Zheng · Chuan Chen

Go to Event Page

Social

Muslims in ML Social

Ahmed Youssef · Suleiman Ali Khan · Gasser Elbanna · Ehsaneddin Asgari · Shaokai Yang · Kamran Soomro

7:00 PM - 9:00 PM

This social aims to create a welcoming and inclusive space for Muslim researchers, and students at ICML to connect, support one another, and build community. Everyone is welcome—regardless of background, identity, or beliefs.

The session will include:

- A brief welcome and introductions

- 1:1 and small group mentorship matching (covering topics like graduate school, industry, and academic careers)

- Informal networking over refreshments and open discussion

This event is designed to foster meaningful connections, provide career guidance, and offer a relaxed environment for reflection and support.

... more

Social

Multi-Agent Systems in Ambient Settings

Rapael Kalandadze · Tatia Tsmindashvili

7:00 PM - 9:00 PM

From Lab to Life: Orchestrating Ambient Agents in the Real World
Rapael Kalandadze (AI Lab, Wandero)
> This talk explores the shift of multi-agent systems from controlled experiments to real-world deployment. We'll examine key challenges, effective strategies, and practical examples of building systems that truly work. This isn't science fiction anymore - it's large-scale system design in action.

2. Teaching Ambient Agents to Understand and Pursue Human Intent
Shirley Wu (Stanford, Microsoft Research)
> This talk explores how long-term alignment strategies can make ambient agent systems more helpful, efficient, and truly human-centered. Shirley Wu presents CollabLLM, a framework that trains agents to look beyond immediate replies by simulating multi-turn interactions and rewarding responses that advance conversations over time. The result: proactive agents that clarify intent, surface missing context, and collaborate more naturally in ambient, ongoing settings.

3. Safety Guarantees for Ambient Agents via Asking for Help
Benjamin Plaut (UC Berkeley, Stanford)
> Most reinforcement learning algorithms essentially rely on trial-and-error: they explore all possible behaviors and see what works well. However, this approach is problematic when some actions are "catastrophic", i.e., irreparable. Ambient computer-use agents have access to many irreparable actions, such as deleting crucial files or sending disastrous emails. We show that designing agents to ask for help in unfamiliar situations improves safety both theoretically and empirically. We believe this is a first step towards a scalable foundation for trustworthy always-on AI systems.

4. WATCHing the Watchers: Real-Time Monitoring for Safer AI Agents.
Drew Prinster (John Hopkins, Yale)
> This talk explores how adaptive monitoring systems can detect, interpret, and respond to failures in long-running AI agents. As agentic systems move from lab to deployment - often operating without constant human oversight - the need for robust, real-time monitoring becomes critical. Drew Prinster presents WATCH, a statistical monitoring framework that rapidly detects performance shifts, distinguishes harmless from dangerous changes, and pinpoints the cause of degradation. This approach enables safer, more reliable deployment of AI in dynamic, high-stakes environments like healthcare or large-scale interactive systems, where false alarms and undetected failures both carry serious consequences.

Panel Discussion - “The Ambient Shift: Redefining Intelligence, Safety & Exploration in Multi-Agent Systems”
Panelists: Yiding Jiang (Carnegie Melon, Google Research),
Clément Romac (Huggingface, Inria), Jindong Wang (William & Mary, Microsoft Research)
> As AI agents move from isolated chats to always-on ambient systems fundamental questions arise: How should these agents explore, generalize, and align with user goals in dynamic, high-dimensional environments? How can we trust them when they act autonomously and concurrently? And what new infrastructures are needed to support this paradigm shift? How can we train LLMs to be fully reliable in production environments?
This panel brings together leading researchers at the intersection of curiosity-driven learning, agent safety, and evaluation to reimagine agent intelligence in ambient settings. This discussion will surface the key redefinitions shaping the future of multi-agent systems.

... more

Social

How to Break Into an Industry Research Lab

Jordan Sale

7:00 PM - 9:00 PM

Transitioning from academia into a frontier industry research lab is one of the most exciting - and confusing - moves an AI researcher can make. Many graduating PhDs assume that strong publications and technical ability are all it takes to succeed in industry.

However, at top labs, researchers are expected not only to build world-class systems - but also to navigate complex org structures, advocate for themselves, secure resources, and align with fast-moving business priorities.

But - many early-career researchers have never recruited, negotiated, or managed stakeholder politics. The result? A massive information asymmetry - where super capable researchers struggle to advance simply because they don’t have access to the unspoken rules.

This social aims to level the playing field.

The panelists will share hard-earned insights on how to break into industry, choose the right company and team, navigate interviews and negotiation, and set yourself up for long-term success.

We want attendees to walk away with less fear, more clarity, and an actionable playbook for launching a successful research career in industry!

🎁 Bonuses:

Everyone who attends will also receive:

- A 62-page Technical Interview Guide for AI Researchers with real interview questions from the OpenAI, Anthropic, and Microsoft interview loops

- The scripts used to negotiate $$75K in additional compensation at every FAANG company

... more

Social

AI Security & Policy Social

Eliza Cudmore · Olivia Jimenez · Shannon Yang

7:00 PM - 9:00 PM

How can we extract deeper insights from LLM evaluations?

Join experts from the UK AI Security Institute for an interactive discussion at ICML focused on improving how we analyse, interpret, and act on evaluation data for frontier AI systems. As large language models become more capable and influential, evaluations have become a cornerstone of scientific understanding, safety assessments, and deployment decisions. Yet current evaluation designs and methodologies are often poorly suited to answering the questions we care most about—such as uncovering latent capabilities, forecasting performance trajectories, and identifying dangerous failure modes.

This session will explore four key dimensions of evaluation methodology: developing tools for richer evaluation-data analysis; advancing statistical techniques for uncertainty and variability; building efficient evaluation pipelines that prioritise signal-rich tasks; and mapping evaluation results onto capability or risk thresholds. We’ll identify open research questions, promising methodological directions, and opportunities for collaboration to make evaluations more rigorous, interpretable, and decision-relevant.

Whether you you are an eval designer yourself, train your own models, or work on risks related to safety and misuse, this session will help you think critically about the importance of evaluation insights to your own work.

... more

Main Navigation

Registration West

Registration East

ICML Lounge Area

New In ML

AI's Models of the World, and Ours

Exhibits

Oral 1E Theory and Phenomenology

Oral 1B Positions: Better Ways to Do Machine Learning

Oral 1D Learning Dynamics 1

Oral 1C Applications in Computer Vision

Oral 1A Alignment and Agents

Poster Session 1 East

Science Communication 101: How to write an elevator pitch for your research

Poster Session 1 West

Generative AI's Collision with Copyright Law

Oral 2C Reinforcement Learning

Oral 2B Positions: AI Regulation and Safety

Oral 2A Diffusion Models

Oral 2E Optimal Transport

Oral 2D Efficient ML

Poster Session 2 West

Poster Session 2 East

Muslims in ML Social

Multi-Agent Systems in Ambient Settings

How to Break Into an Industry Research Lab

AI Security & Policy Social