ICML 2021 Tuesday 07/20

Timezone: Europe/Vienna

Full Schedule Sun 7/18 Mon 7/19 Tue 7/20 Wed 7/21 Thu 7/22 Fri 7/23 Sat 7/24

Social

LatinX in AI Social

Maria Luisa Santiago · Miguel Alonso Jr · William Berrios

2:00 AM - 4:00 AM

Launched in January 2018, leaders from academia and industry in Artificial Intelligence, Education, Research, Finance, Community and Social Impact Nonprofits banded together to create a group that would be focused on “Creating Opportunity for LatinX in AI.”

Artificial Intelligence has the potential to displace workers of marginalized populations including those of Latinx origin. AI is already perpetuating social bias and prejudice because it lacks representation of LatinX professionals in the AI industry. Machine learning algorithms can encode a discriminative bias during training with real-world data in which underrepresented groups are not properly characterized or represented. A question quickly emerges: how can we make sure Machine Learning does not discriminate against people from minority groups because of the color of their skin, gender, ethnicity, or historically unbalanced power structures in society?

Even more, as the tech industry does not represent the entire population, underrepresented populations in computing such as Hispanics, women, African-Americans, and Native Americans have limited control over the direction of machine learning breakthroughs. As an ethnicity, the Latinx population is an interesting case study for this research as members are comprised of all skin tones with a wide regional distribution across the world.

In this session, we claim that it is our responsibility to advance the progress of machine learning by increasing the presence of members of our minority group that are able to build solutions and algorithms to advance the progress of this field towards a direction in which AI is being used to solve problems in our communities while bias and unfairness are accordingly addressed. As the number of Hispanic and Latinx identifying AI practitioners increases, it is also imperative for us to have access to share our work at international AI and Machine Learning conferences which yield new opportunities for collaboration, funding, and job prospects we would not have access to otherwise.

... more

Affinity Poster Session

LatinX in AI, Queer in AI, WiML - Joint Poster Session

3:00 AM - 5:00 AM

Gathertown Link coming soon......

... more

Tutorial

Privacy in learning: Basics and the interplay

Huishuai Zhang · Wei Chen

5:00 AM - 8:00 AM

In the real world, more and more customers view privacy as a concern when using an AI service, especially when the customer content consists of sensitive data. Recent research demonstrates that large language model like GPT-2 can memorize content, which can be extracted by an adversary. This poses high privacy risk in deployed scenarios when models are trained on customer data. Differential privacy is widely recognized as a golden standard of privacy protection due to its mathematical rigor. To alleviate the privacy concern in machine learning, many research works have studied the machine learning with differential privacy guarantee. It is the time to clarify the challenge and opportunity for learning with differential privacy. In this tutorial, we first describe the potential privacy risk in machine learning models and introduce the background of differential privacy, then present the popular approaches of guaranteeing differential privacy in machine learning. In the rest of the tutorial, we highlight the interplay between learning and privacy. In the second section, we show how to utilize the learning property to improve the utility of private learning, especially with recent advances towards solving these challenges by exploiting the correlation across data points and the low-rank property of the deep learning models. In the third section, we present the other direction of research, i.e., using the tools in differential privacy to tackle the classical generalization problem and we also present concrete scenarios of using ideas in differential privacy to resist attacks in machine learning.

... more

Tutorial

Self-Attention for Computer Vision

Aravind Srinivas · Prajit Ramachandran · Ashish Vaswani

5:00 AM - 7:45 AM

The tutorial will be about the application of self-attention mechanisms in computer vision. Self-Attention has been widely adopted in NLP, with the fully attentional Transformer model having largely replaced RNNs and now being used in state-of-the-art language understanding models like GPT, BERT, XLNet, T5, Electra, and Meena. Thus, there has been a tremendous interest in studying whether self-attention can have a similarly big and far-reaching impact in computer vision. However, vision tasks have different properties compared to language tasks, so a lot of research has been devoted to exploring the best way to apply self-attention to visual models. This tutorial will cover many of the different applications of self-attention in vision in order to give the viewer a broad and precise understanding of this subfield.

... more

Oral

Auto-ML and Optimization

2:00 PM - 3:00 PM

7 Events in this session

BORE: Bayesian Optimization by Density-Ratio Estimation

Louis Chi-Chun Tiao · Aaron Klein · Matthias W Seeger · Edwin V Bonilla · Cedric Archambeau · Fabio Ramos

AutoSampling: Search for Effective Data Sampling Schedules

MING SUN · Haoxuan Dou · Baopu Li · Junjie Yan · Wanli Ouyang · Lei Cui

HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search

Niv Nayman · Yonathan Aflalo · Asaf Noy · Lihi Zelnik

Bias-Robust Bayesian Optimization via Dueling Bandits

Johannes Kirschner · Andreas Krause

Zeroth-Order Non-Convex Learning via Hierarchical Dual Averaging

Amélie Héliou · Matthieu Martin · Panayotis Mertikopoulos · Thibaud J Rahier

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma · Jean-Christophe Pesquet

Q&A

Go to Event Page

Oral

Deep Reinforcement Learning 2

2:00 PM - 3:00 PM

7 Events in this session

Deeply-Debiased Off-Policy Interval Estimation

Chengchun Shi · Runzhe Wan · Victor Chernozhukov · Rui Song

Offline Contextual Bandits with Overparameterized Models

David Brandfonbrener · William Whitney · Rajesh Ranganath · Joan Bruna

Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation

Christopher Dance · Perez Julien · Théo Cachet

A New Representation of Successor Features for Transfer across Dissimilar Environments

Majid Abdolshah · Hung Le · Thommen Karimpanal George · Sunil Gupta · Santu Rana · Svetha Venkatesh

Preferential Temporal Difference Learning

Nishanth Anand · Doina Precup

On the Optimality of Batch Policy Optimization Algorithms

Chenjun Xiao · Yifan Wu · Jincheng Mei · Bo Dai · Tor Lattimore · Lihong Li · Csaba Szepesvari · Dale Schuurmans

Q&A

Go to Event Page

Oral

Optimal Transport

2:00 PM - 3:00 PM

8 Events in this session

Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Jiaojiao Fan · Amirhossein Taghvaei · Yongxin Chen

Outlier-Robust Optimal Transport

Debarghya Mukherjee · Aritra Guha · Justin Solomon · Yuekai Sun · Mikhail Yurochkin

Dataset Dynamics via Gradient Flows in Probability Space

David Alvarez-Melis · Nicolo Fusi

Sliced Iterative Normalizing Flows

Biwei Dai · Uros Seljak

Low-Rank Sinkhorn Factorization

Meyer Scetbon · Marco Cuturi · Gabriel Peyré

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras · Thibault Séjourné · Rémi Flamary · Nicolas Courty

Making transport more robust and interpretable by moving data through a small number of anchor points

Chi-Heng Lin · Mehdi Azabou · Eva Dyer

Q&A

Go to Event Page

Oral

Deep Learning Applications

2:00 PM - 3:00 PM

8 Events in this session

Attention is not all you need: pure attention loses rank doubly exponentially with depth

Yihe Dong · Jean-Baptiste Cordonnier · Andreas Loukas

Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation

Sam Devlin · Raluca Georgescu · Ida Momennejad · Jaroslaw Rzepecki · Evelyn Zuniga · Gavin Costello · Guy Leroy · Ali Shaw · Katja Hofmann

Efficient Generative Modelling of Protein Structure Fragments using a Deep Markov Model

Christian Thygesen · Christian Skjødt Steenmans · Ahmad Salim Al-Sibahi · Lys Sanz Moreta · Anders Bundgård Sørensen · Thomas Hamelryck

Exploiting structured data for learning contagious diseases under incomplete testing

Maggie Makar · Lauren R West · David C Hooper · Eric Horvitz · Erica Shenoy · John Guttag

Strategic Classification Made Practical

Sagi Levanon · Nir Rosenfeld

Large-Margin Contrastive Learning with Distance Polarization Regularizer

Shuo Chen · Gang Niu · Chen Gong · Jun Li · Jian Yang · Masashi Sugiyama

SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation

Wuxinlin Cheng · Chenhui Deng · Zhiqiang Zhao · Yaohui Cai · Zhiru Zhang · Zhuo Feng

Q&A

Go to Event Page

Oral

Reinforcement Learning (Multi-agent)

2:00 PM - 3:00 PM

7 Events in this session

Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

Joel Z Leibo · Edgar Duenez-Guzman · Alexander Vezhnevets · John Agapiou · Peter Sunehag · Raphael Koster · Jayd Matyas · Charles Beattie · Igor Mordatch · Thore Graepel

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Tarun Gupta · Anuj Mahajan · Bei Peng · Wendelin Boehmer · Shimon Whiteson

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Dong Ki Kim · Miao Liu · Matthew Riemer · Chuangchuang Sun · Marwa Abdulhai · Golnaz Habibi · Sebastian Lopez-Cot · Gerald Tesauro · Jonathan How

Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers

Luke Marris · Paul Muller · Marc Lanctot · Karl Tuyls · Thore Graepel

PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration

Yuda Song · Wen Sun

Imitation by Predicting Observations

Andrew Jaegle · Yury Sulsky · Arun Ahuja · Jake Bruce · Rob Fergus · Greg Wayne

Q&A

Go to Event Page

Oral

Deep Learning Architectures

2:00 PM - 3:00 PM

8 Events in this session

Relative Positional Encoding for Transformers with Linear Complexity

Antoine Liutkus · Ondřej Cífka · Shih-Lun Wu · Umut Simsekli · Yi-Hsuan Yang · Gaël RICHARD

A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration

Yuhang Li · Shikuang Deng · Xin Dong · Ruihao Gong · Shi Gu

A Unified Lottery Ticket Hypothesis for Graph Neural Networks

Tianlong Chen · Yongduo Sui · Xuxi Chen · Aston Zhang · Zhangyang “Atlas” Wang

Generative Adversarial Transformers

Drew A. Hudson · Larry Zitnick

Evolving Attention with Residual Convolutions

Yujing Wang · Yaming Yang · Jiangang Bai · Mingliang Zhang · Jing Bai · JING YU · Ce Zhang · Gao Huang · Yunhai Tong

Zoo-Tuning: Adaptive Transfer from A Zoo of Models

Yang Shu · Zhi Kou · Zhangjie Cao · Jianmin Wang · Mingsheng Long

UnICORNN: A recurrent model for learning very long time dependencies

T. Konstantin Rusch · Siddhartha Mishra

Q&A

Go to Event Page

Oral

Graph Learning

2:00 PM - 3:00 PM

8 Events in this session

Size-Invariant Graph Representations for Graph Classification Extrapolations

Beatrice Bevilacqua · Yangze Zhou · Bruno Ribeiro

Consistent Nonparametric Methods for Network Assisted Covariate Estimation

Xueyu Mao · Deepayan Chakrabarti · Purnamrita Sarkar

Explainable Automated Graph Representation Learning with Hyperparameter Importance

Xin Wang · Shuyi Fan · Kun Kuang · Wenwu Zhu

Breaking the Limits of Message Passing Graph Neural Networks

Muhammet Balcilar · Pierre Heroux · Benoit Gauzere · Pascal Vasseur · Sebastien Adam · Paul Honeine

From Local Structures to Size Generalization in Graph Neural Networks

Gilad Yehudai · Ethan Fetaya · Eli Meirom · Gal Chechik · Haggai Maron

Interpretable Stability Bounds for Spectral Graph Filters

Henry Kenlay · Dorina Thanou · Xiaowen Dong

Learning Node Representations Using Stationary Flow Prediction on Large Payment and Cash Transaction Networks

Ciwan Ceylan · Salla Franzén · Florian T. Pokorny

Q&A

Go to Event Page

Oral

Deep Reinforcement Learning 1

2:00 PM - 3:00 PM

8 Events in this session

Phasic Policy Gradient

Karl Cobbe · Jacob Hilton · Oleg Klimov · John Schulman

Reinforcement Learning with Prototypical Representations

Denis Yarats · Rob Fergus · Alessandro Lazaric · Lerrel Pinto

Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration

Seungyul Han · Youngchul Sung

Muesli: Combining Improvements in Policy Optimization

Matteo Hessel · Ivo Danihelka · Fabio Viola · Arthur Guez · Simon Schmitt · Laurent Sifre · Theophane Weber · David Silver · Hado van Hasselt

Unsupervised Learning of Visual 3D Keypoints for Control

Boyuan Chen · Pieter Abbeel · Deepak Pathak

Learning Task Informed Abstractions

Xiang Fu · Ge Yang · Pulkit Agrawal · Tommi Jaakkola

State Entropy Maximization with Random Encoders for Efficient Exploration

Younggyo Seo · Lili Chen · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee

Q&A

Go to Event Page

Oral

Optimization (Distributed)

2:00 PM - 3:00 PM

8 Events in this session

Optimal Complexity in Decentralized Training

Yucheng Lu · Christopher De Sa

Stochastic Sign Descent Methods: New Algorithms and Better Theory

Mher Safaryan · Peter Richtarik

Bias-Variance Reduced Local SGD for Less Heterogeneous Federated Learning

Tomoya Murata · Taiji Suzuki

A Hybrid Variance-Reduced Method for Decentralized Stochastic Non-Convex Optimization

Ran Xin · Usman Khan · Soummya Kar

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction

Kenta Niwa · Guoqiang Zhang · W. Bastiaan Kleijn · Noboru Harada · Hiroshi Sawada · Akinori Fujino

Newton Method over Networks is Fast up to the Statistical Precision

Amir Daneshmand · Gesualdo Scutari · Pavel Dvurechenskii · Alexander Gasnikov

Federated Learning under Arbitrary Communication Patterns

Dmitrii Avdiukhin · Shiva Kasiviswanathan

Q&A

Go to Event Page

Oral

Optimization 1

3:00 PM - 4:00 PM

8 Events in this session

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

Zhize Li · Hongyan Bao · Xiangliang Zhang · Peter Richtarik

Projection Robust Wasserstein Barycenters

Minhui Huang · Shiqian Ma · Lifeng Lai

Efficient Message Passing for 0–1 ILPs with Binary Decision Diagrams

Jan-Hendrik Lange · Paul Swoboda

Distributionally Robust Optimization with Markovian Data

Mengmeng Li · Tobias Sutter · Daniel Kuhn

Acceleration via Fractal Learning Rate Schedules

Naman Agarwal · Surbhi Goel · Cyril Zhang

A Novel Sequential Coreset Method for Gradient Descent Algorithms

Jiawei Huang · Ruomin Huang · wenjie liu · Nikolaos Freris · Hu Ding

Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More

Johannes Gasteiger · Marten Lienen · Stephan Günnemann

Q&A

Go to Event Page

Oral

Deep Learning Theory 1

3:00 PM - 4:00 PM

8 Events in this session

Let's Agree to Degree: Comparing Graph Convolutional Networks in the Message-Passing Framework

Floris Geerts · Filip Mazowiecki · Guillermo Perez

Fundamental Tradeoffs in Distributionally Adversarial Training

Mohammad Mehrabi · Adel Javanmard · Ryan A. Rossi · Anup Rao · Tung Mai

Towards Understanding Learning in Neural Networks with Linear Teachers

Roei Sarussi · Alon Brutzkus · Amir Globerson

Continual Learning in the Teacher-Student Setup: Impact of Task Similarity

Sebastian Lee · Sebastian Goldt · Andrew Saxe

A Functional Perspective on Learning Symmetric Functions with Neural Networks

Aaron Zweig · Joan Bruna

Weisfeiler and Lehman Go Topological: Message Passing Simplicial Networks

Cristian Bodnar · Fabrizio Frasca · Yuguang Wang · Nina Otter · Guido Montufar · Pietro Lió · Michael Bronstein

On the Random Conjugate Kernel and Neural Tangent Kernel

Zhengmian Hu · Heng Huang

Q&A

Go to Event Page

Oral

Deep Generative Model 1

3:00 PM - 4:00 PM

8 Events in this session

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

Will Grathwohl · Kevin Swersky · Milad Hashemi · David Duvenaud · Chris Maddison

Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference

Shumao Zhang · Pengchuan Zhang · Thomas Hou

GraphDF: A Discrete Flow Model for Molecular Graph Generation

Youzhi Luo · Keqiang Yan · Shuiwang Ji

Hierarchical VAEs Know What They Don’t Know

Jakob D. Havtorn · Jes Frellsen · Søren Hauberg · Lars Maaløe

Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation

Xiaohui Chen · Xu Han · Jiajing Hu · Francisco Ruiz · Liping Liu

Generative Video Transformer: Can Objects be the Words?

Yi-Fu Wu · Jaesik Yoon · Sungjin Ahn

Poisson-Randomised DirBN: Large Mutation is Needed in Dirichlet Belief Networks

Xuhui Fan · Bin Li · Yaqiong Li · Scott SIsson

Q&A

Go to Event Page

Oral

Deep Learning (Bayesian)

3:00 PM - 4:00 PM

8 Events in this session

What Are Bayesian Neural Network Posteriors Really Like?

Pavel Izmailov · Sharad Vikram · Matthew Hoffman · Andrew Wilson

Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning

Alexander Immer · Matthias Bauer · Vincent Fortuin · Gunnar Ratsch · Khan Emtiyaz

Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation

Aurick Zhou · Sergey Levine

Deep kernel processes

Laurence Aitchison · Adam Yang · Sebastian Ober

Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes

Sebastian Ober · Laurence Aitchison

Bayesian Deep Learning via Subnetwork Inference

Erik Daxberger · Eric Nalisnick · James Allingham · Javier Antorán · Jose Miguel Hernandez-Lobato

Generative Particle Variational Inference via Estimation of Functional Gradients

Neale Ratzlaff · Jerry Bai · Fuxin Li · Wei Xu

Q&A

Go to Event Page

Oral

Deep Learning Algorithms 1

3:00 PM - 4:00 PM

8 Events in this session

Leveraging Sparse Linear Layers for Debuggable Deep Networks

Eric Wong · Shibani Santurkar · Aleksander Madry

Voice2Series: Reprogramming Acoustic Models for Time Series Classification

Huck Yang · Yun-Yun Tsai · Pin-Yu Chen

Self-Tuning for Data-Efficient Deep Learning

Ximei Wang · Jinghan Gao · Mingsheng Long · Jianmin Wang

How Framelets Enhance Graph Neural Networks

Xuebin Zheng · Bingxin Zhou · Junbin Gao · Yuguang Wang · Pietro Lió · Ming Li · Guido Montufar

Federated Continual Learning with Weighted Inter-client Transfer

Jaehong Yoon · Wonyong Jeong · GiWoong Lee · Eunho Yang · Sung Ju Hwang

Self Normalizing Flows

T. Anderson Keller · Jorn Peters · Priyank Jaini · Emiel Hoogeboom · Patrick Forré · Max Welling

Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling

Gregory Benton · Wesley Maddox · Sanae Lotfi · Andrew Wilson

Q&A

Go to Event Page

Oral

Deep Learning Algorithms 2

3:00 PM - 4:00 PM

8 Events in this session

Principled Simplicial Neural Networks for Trajectory Prediction

T. Mitchell Roddenberry · Nicholas Glaze · Santiago Segarra

Efficient Differentiable Simulation of Articulated Bodies

Yi-Ling Qiao · Junbang Liang · Vladlen Koltun · Ming Lin

On Monotonic Linear Interpolation of Neural Network Parameters

James Lucas · Juhan Bae · Michael Zhang · Stanislav Fort · Richard Zemel · Roger Grosse

Connecting Sphere Manifolds Hierarchically for Regularization

Damien Scieur · Youngsung Kim

Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks

Eli Meirom · Haggai Maron · Shie Mannor · Gal Chechik

Thinking Like Transformers

Gail Weiss · Yoav Goldberg · Eran Yahav

Federated Learning of User Verification Models Without Sharing Embeddings

Hossein Hosseini · Hyunsin Park · Sungrack Yun · Christos Louizos · Joseph B Soriaga · Max Welling

Q&A

Go to Event Page

Oral

Reinforcement Learning 1

3:00 PM - 4:00 PM

8 Events in this session

Risk-Sensitive Reinforcement Learning with Function Approximation: A Debiasing Approach

Yingjie Fei · Zhuoran Yang · Zhaoran Wang

Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity

Zhang Zihan · Yuan Zhou · Xiangyang Ji

Neuro-algorithmic Policies Enable Fast Combinatorial Generalization

Marin Vlastelica · Michal Rolinek · Georg Martius

PID Accelerated Value Iteration Algorithm

Amir-massoud Farahmand · Mohammad Ghavamzadeh

Provably Efficient Learning of Transferable Rewards

Alberto Maria Metelli · Giorgia Ramponi · Alessandro Concetti · Marcello Restelli

Reinforcement Learning for Cost-Aware Markov Decision Processes

Wesley A Suttle · Kaiqing Zhang · Zhuoran Yang · Ji Liu · David N Kraemer

Value Alignment Verification

Daniel Brown · Jordan Schneider · Anca Dragan · Scott Niekum

Q&A

Go to Event Page

Oral

AutoML and Deep Architecture

3:00 PM - 4:00 PM

8 Events in this session

Neural Architecture Search without Training

Joe Mellor · Jack Turner · Amos Storkey · Elliot Crowley

Is Space-Time Attention All You Need for Video Understanding?

Gedas Bertasius · Heng Wang · Lorenzo Torresani

A Probabilistic Approach to Neural Network Pruning

Xin Qian · Diego Klabjan

KNAS: Green Neural Architecture Search

Jingjing Xu · Liang Zhao · Junyang Lin · Rundong Gao · Xu SUN · Hongxia Yang

Efficient Lottery Ticket Finding: Less Data is More

Zhenyu Zhang · Xuxi Chen · Tianlong Chen · Zhangyang “Atlas” Wang

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Stéphane d'Ascoli · Hugo Touvron · Matthew Leavitt · Ari Morcos · Giulio Biroli · Levent Sagun

Provably Strict Generalisation Benefit for Equivariant Models

Bryn Elesedy · Sheheryar Zaidi

Q&A

Go to Event Page

Oral

Optimization (Convex) 1

3:00 PM - 4:00 PM

8 Events in this session

Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

Chaobing Song · Stephen Wright · Jelena Diakonikolas

Dueling Convex Optimization

Aadirupa Saha · Tomer Koren · Yishay Mansour

Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs

Tolga Ergen · Mert Pilanci

Parameter-free Locally Accelerated Conditional Gradients

Alejandro Carderera · Jelena Diakonikolas · Cheuk Yin Lin · Sebastian Pokutta

Principal Component Hierarchy for Sparse Quadratic Programs

Robbie Vreugdenhil · Viet Anh Nguyen · Armin Eftekhari · Peyman Mohajerin Esfahani

One-sided Frank-Wolfe algorithms for saddle problems

Vladimir Kolmogorov · Thomas Pock

ConvexVST: A Convex Optimization Approach to Variance-stabilizing Transformation

Mengfan Wang · Boyu Lyu · Guoqiang Yu

Q&A

Go to Event Page

Oral

Deep Learning Algorithms 3

4:00 PM - 5:00 PM

8 Events in this session

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision

Wonjae Kim · Bokyung Son · Ildoo Kim

Learning Curves for Analysis of Deep Networks

Derek Hoiem · Tanmay Gupta · Zhizhong Li · Michal Shlapentokh-Rothman

GLSearch: Maximum Common Subgraph Detection via Learning to Search

Yunsheng Bai · Derek Xu · Yizhou Sun · Wei Wang

Learning Intra-Batch Connections for Deep Metric Learning

Jenny Seidenschwarz · Ismail Elezi · Laura Leal-Taixé

Simultaneous Similarity-based Self-Distillation for Deep Metric Learning

Karsten Roth · Timo Milbich · Bjorn Ommer · Joseph Paul Cohen · Marzyeh Ghassemi

Unifying Vision-and-Language Tasks via Text Generation

Jaemin Cho · Jie Lei · Hao Tan · Mohit Bansal

DeepWalking Backwards: From Embeddings Back to Graphs

Sudhanshu Chanpuriya · Cameron Musco · Konstantinos Sotiropoulos · Charalampos Tsourakakis

Q&A

Go to Event Page

Oral

Reinforcement Learning and Planning 2

4:00 PM - 5:00 PM

8 Events in this session

Skill Discovery for Exploration and Planning using Deep Skill Graphs

Akhil Bagaria · Jason Senthil · George Konidaris

Learning Routines for Effective Off-Policy Reinforcement Learning

Edoardo Cetin · Oya Celiktutan

PODS: Policy Optimization via Differentiable Simulation

Miguel Angel Zamora Mora · Momchil Peychev · Sehoon Ha · Martin Vechev · Stelian Coros

Learning and Planning in Complex Action Spaces

Thomas Hubert · Julian Schrittwieser · Ioannis Antonoglou · Mohammadamin Barekatain · Simon Schmitt · David Silver

Model-Based Reinforcement Learning via Latent-Space Collocation

Oleh Rybkin · Chuning Zhu · Anusha Nagabandi · Kostas Daniilidis · Igor Mordatch · Sergey Levine

Vector Quantized Models for Planning

Sherjil Ozair · Yazhe Li · Ali Razavi · Ioannis Antonoglou · Aäron van den Oord · Oriol Vinyals

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

Pashootan Vaezipoor · Andrew C Li · Rodrigo A Toro Icarte · Sheila McIlraith

Q&A

Go to Event Page

Oral

Deep Learning 1

4:00 PM - 5:00 PM

8 Events in this session

Not All Memories are Created Equal: Learning to Forget by Expiring

Sainbayar Sukhbaatar · Da JU · Spencer Poff · Stephen Roller · Arthur Szlam · Jason Weston · Angela Fan

Learning Bounds for Open-Set Learning

Zhen Fang · Jie Lu · Anjin Liu · Feng Liu · Guangquan Zhang

Perceiver: General Perception with Iterative Attention

Andrew Jaegle · Felix Axel Gimeno Gil · Andy Brock · Oriol Vinyals · Andrew Zisserman · Joao Carreira

Synthesizer: Rethinking Self-Attention for Transformer Models

Yi Tay · Dara Bahri · Don Metzler · Da-Cheng Juan · Zhe Zhao · Che Zheng

Slot Machines: Discovering Winning Combinations of Random Weights in Neural Networks

Maxwell M Aladago · Lorenzo Torresani

What's in the Box? Exploring the Inner Life of Neural Networks with Robust Rules

Jonas Fischer · Anna Olah · Jilles Vreeken

Neural-Pull: Learning Signed Distance Function from Point clouds by Learning to Pull Space onto Surface

Baorui Ma · Zhizhong Han · Yushen Liu · Matthias Zwicker

Q&A

Go to Event Page

Oral

Optimization 2

4:00 PM - 5:00 PM

8 Events in this session

Stability and Convergence of Stochastic Gradient Clipping: Beyond Lipschitz Continuity and Smoothness

Vien Mai · Mikael Johansson

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

Zeke Xie · Li Yuan · Zhanxing Zhu · Masashi Sugiyama

Variational Data Assimilation with a Learned Inverse Observation Operator

Thomas Frerix · Dmitrii Kochkov · Jamie Smith · Daniel Cremers · Michael Brenner · Stephan Hoyer

Fast Projection Onto Convex Smooth Constraints

Ilnura Usmanova · Maryam Kamgarpour · Andreas Krause · Kfir Levy

Decomposable Submodular Function Minimization via Maximum Flow

Kyriakos Axiotis · Adam Karczmarz · Anish Mukherjee · Piotr Sankowski · Adrian Vladu

Multiplicative Noise and Heavy Tails in Stochastic Optimization

Liam Hodgkinson · Michael Mahoney

Distributed Second Order Methods with Fast Rates and Compressed Communication

Rustem Islamov · Xun Qian · Peter Richtarik

Q&A

Go to Event Page

Oral

Reinforcement Learning and Planning 1

4:00 PM - 5:00 PM

7 Events in this session

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang · Ge Yang · Bradly Stadie

Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research

Johan Obando Ceron · Pablo Samuel Castro

Deep Reinforcement Learning amidst Continual Structured Non-Stationarity

Annie Xie · James Harrison · Chelsea Finn

Offline Reinforcement Learning with Pseudometric Learning

Robert Dadashi · Shideh Rezaeifar · Nino Vieillard · Léonard Hussenot · Olivier Pietquin · Matthieu Geist

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Seyed Kamyar Seyed Ghasemipour · Dale Schuurmans · Shixiang Gu

Decision-Making Under Selective Labels: Optimal Finite-Domain Policies and Beyond

Dennis Wei

Q&A

Go to Event Page

Oral

Deep Learning Algorithms 4

4:00 PM - 5:00 PM

8 Events in this session

Directional Graph Networks

Dominique Beaini · Saro Passaro · Vincent Létourneau · Will Hamilton · Gabriele Corso · Pietro Lió

Winograd Algorithm for AdderNet

Wenshuo Li · Hanting Chen · Mingqiang Huang · Xinghao Chen · Chunjing Xu · Yunhe Wang

LieTransformer: Equivariant Self-Attention for Lie Groups

Michael Hutchinson · Charline Le Lan · Sheheryar Zaidi · Emilien Dupont · Yee-Whye Teh · Hyunjik Kim

"Hey, that's not an ODE": Faster ODE Adjoints via Seminorms

Patrick Kidger · Ricky T. Q. Chen · Terry Lyons

Graph Mixture Density Networks

Federico Errica · Davide Bacciu · Alessio Micheli

Momentum Residual Neural Networks

Michael Sander · Pierre Ablin · Mathieu Blondel · Gabriel Peyré

Better Training using Weight-Constrained Stochastic Dynamics

Benedict Leimkuhler · Tiffany Vlaar · Timothée Pouchon · Amos Storkey

Q&A

Go to Event Page

Oral

Reinforcement Learning 2

4:00 PM - 5:00 PM

8 Events in this session

Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition

Bo Liu · Qiang Liu · Peter Stone · Animesh Garg · Yuke Zhu · Anima Anandkumar

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan · Mikayel Samvelyan · Lei Mao · Viktor Makoviychuk · Animesh Garg · Jean Kossaifi · Shimon Whiteson · Yuke Zhu · Anima Anandkumar

A New Formalism, Method and Open Issues for Zero-Shot Coordination

Johannes Treutlein · Michael Dennis · Caspar Oesterheld · Jakob Foerster

Targeted Data Acquisition for Evolving Negotiation Agents

Minae Kwon · Siddharth Karamcheti · Mariano-Florentino Cuellar · Dorsa Sadigh

Inverse Constrained Reinforcement Learning

Shehryar Malik · Usman Anwar · Alireza Aghasi · Ali Ahmed

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Thomas Mesnard · Theophane Weber · Fabio Viola · Shantanu Thakoor · Alaa Saade · Anna Harutyunyan · Will Dabney · Thomas Stepleton · Nicolas Heess · Arthur Guez · Eric Moulines · Marcus Hutter · Lars Buesing · Remi Munos

Interactive Learning from Activity Description

Khanh Nguyen · Dipendra Misra · Robert Schapire · Miroslav Dudik · Patrick Shafto

Q&A

Go to Event Page

Oral

Deep Generative Model 2

4:00 PM - 5:00 PM

8 Events in this session

Spectral Smoothing Unveils Phase Transitions in Hierarchical Variational Autoencoders

Adeel Pervez · Efstratios Gavves

Riemannian Convex Potential Maps

samuel cohen · Brandon Amos · Yaron Lipman

Autoencoding Under Normalization Constraints

Sangwoong Yoon · Yung-Kyun Noh · Frank Chongwoo Park

PixelTransformer: Sample Conditioned Signal Generation

Shubham Tulsiani · Abhinav Gupta

Generative Adversarial Networks for Markovian Temporal Dynamics: Stochastic Continuous Data Generation

Sung Woo Park · Dong Wook Shu · Junseok Kwon

Autoencoder Image Interpolation by Shaping the Latent Space

Alon Oring · Zohar Yakhini · Yacov Hel-Or

Improved Denoising Diffusion Probabilistic Models

Alexander Nichol · Prafulla Dhariwal

Q&A

Go to Event Page

Oral

AutoML and Neural Network Architectures 1

4:00 PM - 5:00 PM

7 Events in this session

OmniNet: Omnidirectional Representations from Transformers

Yi Tay · Mostafa Dehghani · Vamsi Aribandi · Jai Gupta · Philip Pham · Zhen Qin · Dara Bahri · Da-Cheng Juan · Don Metzler

Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size

Jack Kosaian · Amar Phanishayee · Matthai Philipose · Debadeepta Dey · Rashmi Vinayak

E(n) Equivariant Graph Neural Networks

Victor Garcia Satorras · Emiel Hoogeboom · Max Welling

Grid-Functioned Neural Networks

Javier Dehesa · Andrew Vidler · Julian Padget · Christof Lutteroth

MSA Transformer

Roshan Rao · Jason Liu · Robert Verkuil · Joshua Meier · John Canny · Pieter Abbeel · Tom Sercu · Alexander Rives

Parallelizing Legendre Memory Unit Training

Narsimha Reddy Chilkuri · Chris Eliasmith

Q&A

Go to Event Page

Social

Lapsed Physicists Wine-and-Cheese

Jennifer Hobbs · Sujoy Ganguly

4:00 PM - 6:00 PM

Lapsed" (aka. Former) Physicists are plentiful in the machine learning community. Inspired by Wine and Cheese seminars at many institutions, this BYOWC (Bring Your Own Wine and Cheese) event is an informal opportunity to connect with members of the community. Hear how others made the transition between fields. Discuss how your physics training prepared you to switch fields or what synergies between physics and machine learning excite you the most. Share your favorite physics jokes your computer science colleagues don't get, and just meet other cool people. Open to everyone, not only physicists; you'll just have to tolerate our humor. Wine and Cheese encouraged, but not required.

... more

Affinity Workshop

Queer in AI Workshop

Arjun Subramonian · Sharvani Jha · Vishakha Agrawal · Juan Pajaro Velasquez · MaryLena Bleile · Michelle Julia Ng

5:00 PM - 2:00 AM

Queer in AI’s demographic survey reveals that most queer scientists in our community do not feel completely welcome in conferences and their work environments, with the main reasons being a lack of queer community and role models. Over the past years, Queer in AI has worked towards these goals, yet we have observed that the voices of marginalized queer communities - especially transgender, non-binary folks and queer BIPOC folks - have been neglected. The purpose of this workshop is to highlight issues that these communities face by featuring talks and panel discussions on the inclusion of non-Western non-binary identities; and Black, Indigenous, and Pacific Islander non-cis folks.

... more

Invited Talk

Rethinking Drug Discovery in the Era of Digital Biology

Daphne Koller

5:00 PM - 6:00 PM

Modern medicine has given us effective tools to treat some of the most significant and burdensome diseases. At the same time, it is becoming consistently more challenging and more expensive to develop new therapeutics. A key factor in this trend is that the drug development process involves multiple steps, each of which involves a complex and protracted experiment that often fails. We believe that, for many of these phases, it is possible to develop machine learning models to help predict the outcome of these experiments, and that those models, while inevitably imperfect, can outperform predictions based on traditional heuristics. To achieve this goal, we are bringing together high-quality data from human cohorts, while also developing cutting edge methods in high throughput biology and chemistry that can produce massive amounts of in vitro data relevant to human disease and therapeutic interventions. Those are then used to train machine learning models that make predictions about novel targets, coherent patient segments, and the clinical effect of molecules. Our ultimate goal is to develop a new approach to drug development that uses high-quality data and ML models to design novel, safe, and effective therapies that help more people, faster, and at a lower cost.

... more

Speaker Bio

Daphne Koller is CEO and Founder of insitro, a machine-learning enabled drug discovery company transforming the way drugs are discovered and delivered to patients. She is the co-founder of online education platform Engageli and of Coursera, the largest platform for massive open online courses (MOOCs), where she was co-CEO and President. Daphne was the Rajeev Motwani Professor of Computer Science at Stanford University, where she served on the faculty for 18 years. She has also been Chief Computing Officer of Calico, an Alphabet company in the healthcare space. She is the author of over 200-refereed publications appearing in venues such as Science, Cell, and Nature Genetics. Daphne was recognized as one of TIME Magazine¹s 100 most influential people in 2012 and Newsweek¹s 10 most important people in 2010. She has been honored with multiple awards and fellowships during her career including the Sloan Foundation Faculty Fellowship in 1996, the ONR Young Investigator Award in 1998, the Presidential Early Career Award for Scientists and Engineers (PECASE) in 1999, the IJCAI Computers and Thought Award in 2001, the MacArthur Foundation Fellowship in 2004, and the ACM Prize in Computing in 2008. Daphne was inducted into the National Academy of Engineering in 2011 and elected a fellow of the American Association for Artificial Intelligence in 2004, the American Academy of Arts and Sciences in 2014 and of the International Society of Computational Biology in 2017. Her teaching was recognized via the Stanford Medal for Excellence in Fostering Undergraduate Research, and as a Bass University Fellow in Undergraduate Education.

... more

Social

The ICML Debate: Should AI Research and Development Be Controlled by a Regulatory Body or Government Oversight?

Yunpeng Li · Olga Isupova · Nika Haghtalab · Adam White · Diego Granziol

6:00 PM - 7:30 PM

Come and watch experts debate whether AI research and development should be controlled by a regulatory body or government oversight, with Charles Isbell (Georgia Tech), Michael Kearns (UPenn), Rich Sutton (Alberta), Steve Roberts (Oxford), Ti John (Finnish Center for Artificial Intelligence / Aalto), Suchi Saria (John Hopkins), Shakir Mohamed (DeepMind), Martha White (Alberta).

AI has found its way into our everyday life, from healthcare to custom control, creditability check to autonomous driving. Its power is continuously growing, and gradually becomes easier to access for organisations and individuals. This leads to a natural question of the debate.

Enjoy an entertaining social event with 8 leading AI/ML academics and researchers debating the topic following the British Parliament Style. You are welcome to tell us your opinion of the topic before the debate poll. We will also host live votes right before and after the debate to see whether you are convinced by our debaters. Do join us for an unmatched fun and thought-provoking Social.

... more

Poster

Poster Session 1

6:00 PM - 8:00 PM

189 Events in this session

A Functional Perspective on Learning Symmetric Functions with Neural Networks

Aaron Zweig · Joan Bruna

Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation

Aurick Zhou · Sergey Levine

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction

Kenta Niwa · Guoqiang Zhang · W. Bastiaan Kleijn · Noboru Harada · Hiroshi Sawada · Akinori Fujino

Bayesian Deep Learning via Subnetwork Inference

Erik Daxberger · Eric Nalisnick · James Allingham · Javier Antorán · Jose Miguel Hernandez-Lobato

Breaking the Limits of Message Passing Graph Neural Networks

Muhammet Balcilar · Pierre Heroux · Benoit Gauzere · Pascal Vasseur · Sebastien Adam · Paul Honeine

Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition

Bo Liu · Qiang Liu · Peter Stone · Animesh Garg · Yuke Zhu · Anima Anandkumar

Connecting Sphere Manifolds Hierarchically for Regularization

Damien Scieur · Youngsung Kim

Consistent Nonparametric Methods for Network Assisted Covariate Estimation

Xueyu Mao · Deepayan Chakrabarti · Purnamrita Sarkar

Decentralized Riemannian Gradient Descent on the Stiefel Manifold

Shixiang Chen · Alfredo Garcia · Mingyi Hong · Shahin Shahrampour

Decision-Making Under Selective Labels: Optimal Finite-Domain Policies and Beyond

Dennis Wei

Decoupling Value and Policy for Generalization in Reinforcement Learning

Roberta Raileanu · Rob Fergus

Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation

Christopher Dance · Perez Julien · Théo Cachet

Directional Graph Networks

Dominique Beaini · Saro Passaro · Vincent Létourneau · Will Hamilton · Gabriele Corso · Pietro Lió

Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration

Seungyul Han · Youngchul Sung

Dueling Convex Optimization

Aadirupa Saha · Tomer Koren · Yishay Mansour

Efficient Differentiable Simulation of Articulated Bodies

Yi-Ling Qiao · Junbang Liang · Vladlen Koltun · Ming Lin

Fast Projection Onto Convex Smooth Constraints

Ilnura Usmanova · Maryam Kamgarpour · Andreas Krause · Kfir Levy

Fundamental Tradeoffs in Distributionally Adversarial Training

Mohammad Mehrabi · Adel Javanmard · Ryan A. Rossi · Anup Rao · Tung Mai

Generative Adversarial Transformers

Drew A. Hudson · Larry Zitnick

Generative Particle Variational Inference via Estimation of Functional Gradients

Neale Ratzlaff · Jerry Bai · Fuxin Li · Wei Xu

Generative Video Transformer: Can Objects be the Words?

Yi-Fu Wu · Jaesik Yoon · Sungjin Ahn

Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs

Tolga Ergen · Mert Pilanci

GLSearch: Maximum Common Subgraph Detection via Learning to Search

Yunsheng Bai · Derek Xu · Yizhou Sun · Wei Wang

Graph Mixture Density Networks

Federico Errica · Davide Bacciu · Alessio Micheli

Grid-Functioned Neural Networks

Javier Dehesa · Andrew Vidler · Julian Padget · Christof Lutteroth

Hierarchical VAEs Know What They Don’t Know

Jakob D. Havtorn · Jes Frellsen · Søren Hauberg · Lars Maaløe

Interactive Learning from Activity Description

Khanh Nguyen · Dipendra Misra · Robert Schapire · Miroslav Dudik · Patrick Shafto

Interpretable Stability Bounds for Spectral Graph Filters

Henry Kenlay · Dorina Thanou · Xiaowen Dong

Inverse Constrained Reinforcement Learning

Shehryar Malik · Usman Anwar · Alireza Aghasi · Ali Ahmed

Learning Bounds for Open-Set Learning

Zhen Fang · Jie Lu · Anjin Liu · Feng Liu · Guangquan Zhang

Learning Curves for Analysis of Deep Networks

Derek Hoiem · Tanmay Gupta · Zhizhong Li · Michal Shlapentokh-Rothman

Learning Intra-Batch Connections for Deep Metric Learning

Jenny Seidenschwarz · Ismail Elezi · Laura Leal-Taixé

Learning Node Representations Using Stationary Flow Prediction on Large Payment and Cash Transaction Networks

Ciwan Ceylan · Salla Franzén · Florian T. Pokorny

Learning Task Informed Abstractions

Xiang Fu · Ge Yang · Pulkit Agrawal · Tommi Jaakkola

Let's Agree to Degree: Comparing Graph Convolutional Networks in the Message-Passing Framework

Floris Geerts · Filip Mazowiecki · Guillermo Perez

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

Pashootan Vaezipoor · Andrew C Li · Rodrigo A Toro Icarte · Sheila McIlraith

Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity

Zhang Zihan · Yuan Zhou · Xiangyang Ji

Momentum Residual Neural Networks

Michael Sander · Pierre Ablin · Mathieu Blondel · Gabriel Peyré

Muesli: Combining Improvements in Policy Optimization

Matteo Hessel · Ivo Danihelka · Fabio Viola · Arthur Guez · Simon Schmitt · Laurent Sifre · Theophane Weber · David Silver · Hado van Hasselt

Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers

Luke Marris · Paul Muller · Marc Lanctot · Karl Tuyls · Thore Graepel

Multiplicative Noise and Heavy Tails in Stochastic Optimization

Liam Hodgkinson · Michael Mahoney

Not All Memories are Created Equal: Learning to Forget by Expiring

Sainbayar Sukhbaatar · Da JU · Spencer Poff · Stephen Roller · Arthur Szlam · Jason Weston · Angela Fan

On the Random Conjugate Kernel and Neural Tangent Kernel

Zhengmian Hu · Heng Huang

Oops I Took A Gradient: Scalable Sampling for Discrete Distributions

Will Grathwohl · Kevin Swersky · Milad Hashemi · David Duvenaud · Chris Maddison

Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation

Xiaohui Chen · Xu Han · Jiajing Hu · Francisco Ruiz · Liping Liu

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

Zhize Li · Hongyan Bao · Xiangliang Zhang · Peter Richtarik

Parameter-free Locally Accelerated Conditional Gradients

Alejandro Carderera · Jelena Diakonikolas · Cheuk Yin Lin · Sebastian Pokutta

PID Accelerated Value Iteration Algorithm

Amir-massoud Farahmand · Mohammad Ghavamzadeh

PixelTransformer: Sample Conditioned Signal Generation

Shubham Tulsiani · Abhinav Gupta

Poisson-Randomised DirBN: Large Mutation is Needed in Dirichlet Belief Networks

Xuhui Fan · Bin Li · Yaqiong Li · Scott SIsson

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

Zeke Xie · Li Yuan · Zhanxing Zhu · Masashi Sugiyama

Projection Robust Wasserstein Barycenters

Minhui Huang · Shiqian Ma · Lifeng Lai

Provably Efficient Learning of Transferable Rewards

Alberto Maria Metelli · Giorgia Ramponi · Alessandro Concetti · Marcello Restelli

Provably Strict Generalisation Benefit for Equivariant Models

Bryn Elesedy · Sheheryar Zaidi

Reinforcement Learning with Prototypical Representations

Denis Yarats · Rob Fergus · Alessandro Lazaric · Lerrel Pinto

Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks

Jiaojiao Fan · Amirhossein Taghvaei · Yongxin Chen

Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning

Alexander Immer · Matthias Bauer · Vincent Fortuin · Gunnar Ratsch · Khan Emtiyaz

Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More

Johannes Gasteiger · Marten Lienen · Stephan Günnemann

Self Normalizing Flows

T. Anderson Keller · Jorn Peters · Priyank Jaini · Emiel Hoogeboom · Patrick Forré · Max Welling

Self-Tuning for Data-Efficient Deep Learning

Ximei Wang · Jinghan Gao · Mingsheng Long · Jianmin Wang

Size-Invariant Graph Representations for Graph Classification Extrapolations

Beatrice Bevilacqua · Yangze Zhou · Bruno Ribeiro

Skill Discovery for Exploration and Planning using Deep Skill Graphs

Akhil Bagaria · Jason Senthil · George Konidaris

Slot Machines: Discovering Winning Combinations of Random Weights in Neural Networks

Maxwell M Aladago · Lorenzo Torresani

Sparsifying Networks via Subdifferential Inclusion

Sagar Verma · Jean-Christophe Pesquet

Stability and Convergence of Stochastic Gradient Clipping: Beyond Lipschitz Continuity and Smoothness

Vien Mai · Mikael Johansson

State Entropy Maximization with Random Encoders for Efficient Exploration

Younggyo Seo · Lili Chen · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee

Targeted Data Acquisition for Evolving Negotiation Agents

Minae Kwon · Siddharth Karamcheti · Mariano-Florentino Cuellar · Dorsa Sadigh

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan · Mikayel Samvelyan · Lei Mao · Viktor Makoviychuk · Animesh Garg · Jean Kossaifi · Shimon Whiteson · Yuke Zhu · Anima Anandkumar

Thinking Like Transformers

Gail Weiss · Yoav Goldberg · Eran Yahav

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras · Thibault Séjourné · Rémi Flamary · Nicolas Courty

UnICORNN: A recurrent model for learning very long time dependencies

T. Konstantin Rusch · Siddhartha Mishra

Unifying Vision-and-Language Tasks via Text Generation

Jaemin Cho · Jie Lei · Hao Tan · Mohit Bansal

Unsupervised Learning of Visual 3D Keypoints for Control

Boyuan Chen · Pieter Abbeel · Deepak Pathak

Voice2Series: Reprogramming Acoustic Models for Time Series Classification

Huck Yang · Yun-Yun Tsai · Pin-Yu Chen

What Are Bayesian Neural Network Posteriors Really Like?

Pavel Izmailov · Sharad Vikram · Matthew Hoffman · Andrew Wilson

Winograd Algorithm for AdderNet

Wenshuo Li · Hanting Chen · Mingqiang Huang · Xinghao Chen · Chunjing Xu · Yunhe Wang

World Model as a Graph: Learning Latent Landmarks for Planning

Lunjun Zhang · Ge Yang · Bradly Stadie

Zeroth-Order Non-Convex Learning via Hierarchical Dual Averaging

Amélie Héliou · Matthieu Martin · Panayotis Mertikopoulos · Thibaud J Rahier

Zoo-Tuning: Adaptive Transfer from A Zoo of Models

Yang Shu · Zhi Kou · Zhangjie Cao · Jianmin Wang · Mingsheng Long

Acceleration via Fractal Learning Rate Schedules

Naman Agarwal · Surbhi Goel · Cyril Zhang

A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration

Yuhang Li · Shikuang Deng · Xin Dong · Ruihao Gong · Shi Gu

A Hybrid Variance-Reduced Method for Decentralized Stochastic Non-Convex Optimization

Ran Xin · Usman Khan · Soummya Kar

A New Formalism, Method and Open Issues for Zero-Shot Coordination

Johannes Treutlein · Michael Dennis · Caspar Oesterheld · Jakob Foerster

A New Representation of Successor Features for Transfer across Dissimilar Environments

Majid Abdolshah · Hung Le · Thommen Karimpanal George · Sunil Gupta · Santu Rana · Svetha Venkatesh

A Novel Sequential Coreset Method for Gradient Descent Algorithms

Jiawei Huang · Ruomin Huang · wenjie liu · Nikolaos Freris · Hu Ding

A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

Dong Ki Kim · Miao Liu · Matthew Riemer · Chuangchuang Sun · Marwa Abdulhai · Golnaz Habibi · Sebastian Lopez-Cot · Gerald Tesauro · Jonathan How

A Probabilistic Approach to Neural Network Pruning

Xin Qian · Diego Klabjan

Attention is not all you need: pure attention loses rank doubly exponentially with depth

Yihe Dong · Jean-Baptiste Cordonnier · Andreas Loukas

A Unified Lottery Ticket Hypothesis for Graph Neural Networks

Tianlong Chen · Yongduo Sui · Xuxi Chen · Aston Zhang · Zhangyang “Atlas” Wang

Autoencoder Image Interpolation by Shaping the Latent Space

Alon Oring · Zohar Yakhini · Yacov Hel-Or

Autoencoding Under Normalization Constraints

Sangwoong Yoon · Yung-Kyun Noh · Frank Chongwoo Park

AutoSampling: Search for Effective Data Sampling Schedules

MING SUN · Haoxuan Dou · Baopu Li · Junjie Yan · Wanli Ouyang · Lei Cui

Better Training using Weight-Constrained Stochastic Dynamics

Benedict Leimkuhler · Tiffany Vlaar · Timothée Pouchon · Amos Storkey

Bias-Robust Bayesian Optimization via Dueling Bandits

Johannes Kirschner · Andreas Krause

Bias-Variance Reduced Local SGD for Less Heterogeneous Federated Learning

Tomoya Murata · Taiji Suzuki

Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size

Jack Kosaian · Amar Phanishayee · Matthai Philipose · Debadeepta Dey · Rashmi Vinayak

BORE: Bayesian Optimization by Density-Ratio Estimation

Louis Chi-Chun Tiao · Aaron Klein · Matthias W Seeger · Edwin V Bonilla · Cedric Archambeau · Fabio Ramos

Continual Learning in the Teacher-Student Setup: Impact of Task Similarity

Sebastian Lee · Sebastian Goldt · Andrew Saxe

Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks

Eli Meirom · Haggai Maron · Shie Mannor · Gal Chechik

ConvexVST: A Convex Optimization Approach to Variance-stabilizing Transformation

Mengfan Wang · Boyu Lyu · Guoqiang Yu

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Stéphane d'Ascoli · Hugo Touvron · Matthew Leavitt · Ari Morcos · Giulio Biroli · Levent Sagun

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Dataset Dynamics via Gradient Flows in Probability Space

David Alvarez-Melis · Nicolo Fusi

Decomposable Submodular Function Minimization via Maximum Flow

Kyriakos Axiotis · Adam Karczmarz · Anish Mukherjee · Piotr Sankowski · Adrian Vladu

Deep kernel processes

Laurence Aitchison · Adam Yang · Sebastian Ober

Deeply-Debiased Off-Policy Interval Estimation

Chengchun Shi · Runzhe Wan · Victor Chernozhukov · Rui Song

Deep Reinforcement Learning amidst Continual Structured Non-Stationarity

Annie Xie · James Harrison · Chelsea Finn

DeepWalking Backwards: From Embeddings Back to Graphs

Sudhanshu Chanpuriya · Cameron Musco · Konstantinos Sotiropoulos · Charalampos Tsourakakis

Distributed Second Order Methods with Fast Rates and Compressed Communication

Rustem Islamov · Xun Qian · Peter Richtarik

Distributionally Robust Optimization with Markovian Data

Mengmeng Li · Tobias Sutter · Daniel Kuhn

Efficient Generative Modelling of Protein Structure Fragments using a Deep Markov Model

Christian Thygesen · Christian Skjødt Steenmans · Ahmad Salim Al-Sibahi · Lys Sanz Moreta · Anders Bundgård Sørensen · Thomas Hamelryck

Efficient Lottery Ticket Finding: Less Data is More

Zhenyu Zhang · Xuxi Chen · Tianlong Chen · Zhangyang “Atlas” Wang

Efficient Message Passing for 0–1 ILPs with Binary Decision Diagrams

Jan-Hendrik Lange · Paul Swoboda

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Seyed Kamyar Seyed Ghasemipour · Dale Schuurmans · Shixiang Gu

E(n) Equivariant Graph Neural Networks

Victor Garcia Satorras · Emiel Hoogeboom · Max Welling

Evolving Attention with Residual Convolutions

Yujing Wang · Yaming Yang · Jiangang Bai · Mingliang Zhang · Jing Bai · JING YU · Ce Zhang · Gao Huang · Yunhai Tong

Explainable Automated Graph Representation Learning with Hyperparameter Importance

Xin Wang · Shuyi Fan · Kun Kuang · Wenwu Zhu

Exploiting structured data for learning contagious diseases under incomplete testing

Maggie Makar · Lauren R West · David C Hooper · Eric Horvitz · Erica Shenoy · John Guttag

Federated Continual Learning with Weighted Inter-client Transfer

Jaehong Yoon · Wonyong Jeong · GiWoong Lee · Eunho Yang · Sung Ju Hwang

Federated Learning of User Verification Models Without Sharing Embeddings

Hossein Hosseini · Hyunsin Park · Sungrack Yun · Christos Louizos · Joseph B Soriaga · Max Welling

Federated Learning under Arbitrary Communication Patterns

Dmitrii Avdiukhin · Shiva Kasiviswanathan

From Local Structures to Size Generalization in Graph Neural Networks

Gilad Yehudai · Ethan Fetaya · Eli Meirom · Gal Chechik · Haggai Maron

Generative Adversarial Networks for Markovian Temporal Dynamics: Stochastic Continuous Data Generation

Sung Woo Park · Dong Wook Shu · Junseok Kwon

Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes

Sebastian Ober · Laurence Aitchison

GraphDF: A Discrete Flow Model for Molecular Graph Generation

Youzhi Luo · Keqiang Yan · Shuiwang Ji

HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search

Niv Nayman · Yonathan Aflalo · Asaf Noy · Lihi Zelnik

"Hey, that's not an ODE": Faster ODE Adjoints via Seminorms

Patrick Kidger · Ricky T. Q. Chen · Terry Lyons

How Framelets Enhance Graph Neural Networks

Xuebin Zheng · Bingxin Zhou · Junbin Gao · Yuguang Wang · Pietro Lió · Ming Li · Guido Montufar

Imitation by Predicting Observations

Andrew Jaegle · Yury Sulsky · Arun Ahuja · Jake Bruce · Rob Fergus · Greg Wayne

Improved Denoising Diffusion Probabilistic Models

Alexander Nichol · Prafulla Dhariwal

Is Space-Time Attention All You Need for Video Understanding?

Gedas Bertasius · Heng Wang · Lorenzo Torresani

KNAS: Green Neural Architecture Search

Jingjing Xu · Liang Zhao · Junyang Lin · Rundong Gao · Xu SUN · Hongxia Yang

Large-Margin Contrastive Learning with Distance Polarization Regularizer

Shuo Chen · Gang Niu · Chen Gong · Jun Li · Jian Yang · Masashi Sugiyama

Learning and Planning in Complex Action Spaces

Thomas Hubert · Julian Schrittwieser · Ioannis Antonoglou · Mohammadamin Barekatain · Simon Schmitt · David Silver

Learning Routines for Effective Off-Policy Reinforcement Learning

Edoardo Cetin · Oya Celiktutan

Leveraging Sparse Linear Layers for Debuggable Deep Networks

Eric Wong · Shibani Santurkar · Aleksander Madry

LieTransformer: Equivariant Self-Attention for Lie Groups

Michael Hutchinson · Charline Le Lan · Sheheryar Zaidi · Emilien Dupont · Yee-Whye Teh · Hyunjik Kim

Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling

Gregory Benton · Wesley Maddox · Sanae Lotfi · Andrew Wilson

Low-Rank Sinkhorn Factorization

Meyer Scetbon · Marco Cuturi · Gabriel Peyré

Making transport more robust and interpretable by moving data through a small number of anchor points

Chi-Heng Lin · Mehdi Azabou · Eva Dyer

MC-LSTM: Mass-Conserving LSTM

Pieter-Jan Hoedt · Frederik Kratzert · Daniel Klotz · Christina Halmich · Markus Holzleitner · Grey Nearing · Sepp Hochreiter · Günter Klambauer

Model-Based Reinforcement Learning via Latent-Space Collocation

Oleh Rybkin · Chuning Zhu · Anusha Nagabandi · Kostas Daniilidis · Igor Mordatch · Sergey Levine

MSA Transformer

Roshan Rao · Jason Liu · Robert Verkuil · Joshua Meier · John Canny · Pieter Abbeel · Tom Sercu · Alexander Rives

Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference

Shumao Zhang · Pengchuan Zhang · Thomas Hou

Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation

Sam Devlin · Raluca Georgescu · Ida Momennejad · Jaroslaw Rzepecki · Evelyn Zuniga · Gavin Costello · Guy Leroy · Ali Shaw · Katja Hofmann

NeRF-VAE: A Geometry Aware 3D Scene Generative Model

Adam Kosiorek · Heiko Strathmann · Daniel Zoran · Pol Moreno · Rosalia Schneider · Sona Mokra · Danilo J. Rezende

Neural Architecture Search without Training

Joe Mellor · Jack Turner · Amos Storkey · Elliot Crowley

Neural-Pull: Learning Signed Distance Function from Point clouds by Learning to Pull Space onto Surface

Baorui Ma · Zhizhong Han · Yushen Liu · Matthias Zwicker

Neural Symbolic Regression that scales

Luca Biggio · Tommaso Bendinelli · Alexander Neitz · Aurelien Lucchi · Giambattista Parascandolo

Neuro-algorithmic Policies Enable Fast Combinatorial Generalization

Marin Vlastelica · Michal Rolinek · Georg Martius

Newton Method over Networks is Fast up to the Statistical Precision

Amir Daneshmand · Gesualdo Scutari · Pavel Dvurechenskii · Alexander Gasnikov

Offline Contextual Bandits with Overparameterized Models

David Brandfonbrener · William Whitney · Rajesh Ranganath · Joan Bruna

Offline Reinforcement Learning with Pseudometric Learning

Robert Dadashi · Shideh Rezaeifar · Nino Vieillard · Léonard Hussenot · Olivier Pietquin · Matthieu Geist

OmniNet: Omnidirectional Representations from Transformers

Yi Tay · Mostafa Dehghani · Vamsi Aribandi · Jai Gupta · Philip Pham · Zhen Qin · Dara Bahri · Da-Cheng Juan · Don Metzler

One-sided Frank-Wolfe algorithms for saddle problems

Vladimir Kolmogorov · Thomas Pock

On Monotonic Linear Interpolation of Neural Network Parameters

James Lucas · Juhan Bae · Michael Zhang · Stanislav Fort · Richard Zemel · Roger Grosse

On the Optimality of Batch Policy Optimization Algorithms

Chenjun Xiao · Yifan Wu · Jincheng Mei · Bo Dai · Tor Lattimore · Lihong Li · Csaba Szepesvari · Dale Schuurmans

Optimal Complexity in Decentralized Training

Yucheng Lu · Christopher De Sa

Outlier-Robust Optimal Transport

Debarghya Mukherjee · Aritra Guha · Justin Solomon · Yuekai Sun · Mikhail Yurochkin

Parallelizing Legendre Memory Unit Training

Narsimha Reddy Chilkuri · Chris Eliasmith

PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration

Yuda Song · Wen Sun

Perceiver: General Perception with Iterative Attention

Andrew Jaegle · Felix Axel Gimeno Gil · Andy Brock · Oriol Vinyals · Andrew Zisserman · Joao Carreira

Phasic Policy Gradient

Karl Cobbe · Jacob Hilton · Oleg Klimov · John Schulman

PODS: Policy Optimization via Differentiable Simulation

Miguel Angel Zamora Mora · Momchil Peychev · Sehoon Ha · Martin Vechev · Stelian Coros

Preferential Temporal Difference Learning

Nishanth Anand · Doina Precup

Principal Component Hierarchy for Sparse Quadratic Programs

Robbie Vreugdenhil · Viet Anh Nguyen · Armin Eftekhari · Peyman Mohajerin Esfahani

Principled Simplicial Neural Networks for Trajectory Prediction

T. Mitchell Roddenberry · Nicholas Glaze · Santiago Segarra

Reinforcement Learning for Cost-Aware Markov Decision Processes

Wesley A Suttle · Kaiqing Zhang · Zhuoran Yang · Ji Liu · David N Kraemer

Relative Positional Encoding for Transformers with Linear Complexity

Antoine Liutkus · Ondřej Cífka · Shih-Lun Wu · Umut Simsekli · Yi-Hsuan Yang · Gaël RICHARD

Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research

Johan Obando Ceron · Pablo Samuel Castro

Riemannian Convex Potential Maps

samuel cohen · Brandon Amos · Yaron Lipman

Risk-Sensitive Reinforcement Learning with Function Approximation: A Debiasing Approach

Yingjie Fei · Zhuoran Yang · Zhaoran Wang

Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

Joel Z Leibo · Edgar Duenez-Guzman · Alexander Vezhnevets · John Agapiou · Peter Sunehag · Raphael Koster · Jayd Matyas · Charles Beattie · Igor Mordatch · Thore Graepel

Simultaneous Similarity-based Self-Distillation for Deep Metric Learning

Karsten Roth · Timo Milbich · Bjorn Ommer · Joseph Paul Cohen · Marzyeh Ghassemi

Sliced Iterative Normalizing Flows

Biwei Dai · Uros Seljak

SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation

Wuxinlin Cheng · Chenhui Deng · Zhiqiang Zhao · Yaohui Cai · Zhiru Zhang · Zhuo Feng

Spectral Smoothing Unveils Phase Transitions in Hierarchical Variational Autoencoders

Adeel Pervez · Efstratios Gavves

Stochastic Sign Descent Methods: New Algorithms and Better Theory

Mher Safaryan · Peter Richtarik

Strategic Classification Made Practical

Sagi Levanon · Nir Rosenfeld

Synthesizer: Rethinking Self-Attention for Transformer Models

Yi Tay · Dara Bahri · Don Metzler · Da-Cheng Juan · Zhe Zhao · Che Zheng

Towards Understanding Learning in Neural Networks with Linear Teachers

Roei Sarussi · Alon Brutzkus · Amir Globerson

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Tarun Gupta · Anuj Mahajan · Bei Peng · Wendelin Boehmer · Shimon Whiteson

Value Alignment Verification

Daniel Brown · Jordan Schneider · Anca Dragan · Scott Niekum

Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

Chaobing Song · Stephen Wright · Jelena Diakonikolas

Variational Data Assimilation with a Learned Inverse Observation Operator

Thomas Frerix · Dmitrii Kochkov · Jamie Smith · Daniel Cremers · Michael Brenner · Stephan Hoyer

Vector Quantized Models for Planning

Sherjil Ozair · Yazhe Li · Ali Razavi · Ioannis Antonoglou · Aäron van den Oord · Oriol Vinyals

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision

Wonjae Kim · Bokyung Son · Ildoo Kim

Weisfeiler and Lehman Go Topological: Message Passing Simplicial Networks

Cristian Bodnar · Fabrizio Frasca · Yuguang Wang · Nina Otter · Guido Montufar · Pietro Lió · Michael Bronstein

What's in the Box? Exploring the Inner Life of Neural Networks with Robust Rules

Jonas Fischer · Anna Olah · Jilles Vreeken

Go to Event Page

Social

Improving Global Research Collaboration & Communication

Taylor Marrison

6:00 PM - 7:00 PM

Come learn and share best practices collaborating with researchers around the world, and discuss how to bridge the remote work, cultural, and social divides.

... more

Social

Black in AI Social

Charles Earl · Victor Silva

8:00 PM - 10:00 PM

For over four years, Black in AI has been a place for sharing ideas, fostering collaborations, and discussing initiatives to increase the presence of Black people in the field of Artificial Intelligence. If you are in AI and either self-identify as Black, African, Diaspora or an ally, please join us at ICML21 to discuss interests, challenges, opportunities, collaborations, and other related issues. We plan to gather for a one-hour town hall and Q&A session. We'll then continue with informal socializing for the remaining hour.

... more

Town Hall

John Langford · Marina Meila · Tong Zhang · Stefanie Jegelka · Csaba Szepesvari · Le Song

8:00 PM - 9:00 PM

The ICML town hall is primarily a chance for the community to interact with the ICML organizers and give feedback. We cover various details of this ICML and future plans, with the bulk of the time relegated to discussion.

... more