Downloads 2021

Format:
Posters:
Tutorials:
Invited talks:
Workshops:
Demonstrations:

Number of events: 1231

12-Lead ECG Reconstruction via Koopman Operators
1-bit Adam: Communication Efficient Large-Scale Training with Adam's Convergence Speed
8th ICML Workshop on Automated Machine Learning (AutoML 2021)
A Bit More Bayesian: Domain-Invariant Learning with Uncertainty
A Blessing in Disguise: The Prospects and Perils of Adversarial Machine Learning
Accelerate CNNs from Three Dimensions: A Comprehensive Pruning Framework
Accelerated Algorithms for Smooth Convex-Concave Minimax Problems with O(1/k^2) Rate on Squared Gradient Norm
Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving
Accelerating Gossip SGD with Periodic Global Averaging
Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies
Acceleration via Fractal Learning Rate Schedules
Accumulated Decoupled Learning with Gradient Staleness Mitigation for Convolutional Neural Networks
Accuracy, Interpretability, and Differential Privacy via Explainable Boosting
Accuracy on the Line: on the Strong Correlation Between Out-of-Distribution and In-Distribution Generalization
Accurate Post Training Quantization With Small Calibration Sets
ACE: Explaining cluster from an adversarial perspective
Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously
A Collective Learning Framework to Boost GNN Expressiveness for Node Classification
Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
Active Covering
Active Deep Probabilistic Subsampling
Active Feature Acquisition with Generative Surrogate Models
Active Learning for Distributionally Robust Level-Set Estimation
Active Learning of Continuous-time Bayesian Networks through Interventions
Active Slices for Sliced Stein Discrepancy
Active Testing: Sample-Efficient Model Evaluation
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Adapting to Delays and Data in Adversarial Multi-Armed Bandits
Adapting to misspecification in contextual bandits with offline regression oracles
Adaptive Newton Sketch: Linear-time Optimization with Quadratic Convergence and Effective Hessian Dimensionality
Adaptive Sampling for Best Policy Identification in Markov Decision Processes
AdaXpert: Adapting Neural Architecture for Growing Data
Additive Error Guarantees for Weighted Low Rank Approximation
Addressing Catastrophic Forgetting in Few-Shot Problems
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation
A Differentiable Point Process with Its Application to Spiking Neural Networks
A Discriminative Technique for Multiple-Source Adaptation
A Distribution-dependent Analysis of Meta Learning
ADOM: Accelerated Decentralized Optimization Method for Time-Varying Networks
Adversarial Combinatorial Bandits with General Non-linear Reward Functions
Adversarial Dueling Bandits
Adversarial Multi Class Learning under Weak Supervision with Performance Guarantees
Adversarial Option-Aware Hierarchical Imitation Learning
Adversarial Policy Learning in Two-player Competitive Games
Adversarial Purification with Score-based Generative Models
Adversarial Robustness Guarantees for Random Deep Neural Networks
Affine Invariant Analysis of Frank-Wolfe on Strongly Convex Sets
A Framework for Private Matrix Analysis in Sliding Window Model
A Free Lunch From ANN: Towards Efficient, Accurate Spiking Neural Networks Calibration
A Functional Perspective on Learning Symmetric Functions with Neural Networks
A General Framework For Detecting Anomalous Inputs to DNN Classifiers
AGENT: A Benchmark for Core Psychological Reasoning
Aggregating From Multiple Target-Shifted Sources
Agnostic Learning of Halfspaces with Gradient Descent via Soft Margins
A Gradient Based Strategy for Hamiltonian Monte Carlo Hyperparameter Optimization
A Hybrid Variance-Reduced Method for Decentralized Stochastic Non-Convex Optimization
A Language for Counterfactual Generative Models
A large-scale benchmark for few-shot program induction and synthesis
Align, then memorise: the dynamics of learning with feedback alignment
Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits
A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning
AlphaNet: Improved Training of Supernets with Alpha-Divergence
Alternative Microfoundations for Strategic Classification
A Modular Analysis of Provable Acceleration via Polyak's Momentum: Training a Wide ReLU Network and a Deep Linear Network
Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation
An Algorithm for Stochastic and Adversarial Bandits with Switching Costs
Analysis of stochastic Lanczos quadrature for spectrum approximation
Analyzing the tree-layer structure of Deep Forests
An End-to-End Framework for Molecular Conformation Generation via Bilevel Programming
A New Formalism, Method and Open Issues for Zero-Shot Coordination
A New Representation of Successor Features for Transfer across Dissimilar Environments
An exact solver for the Weston-Watkins SVM subproblem
An Identifiable Double VAE For Disentangled Representations
An Information-Geometric Distance on the Space of Tasks
An Integer Linear Programming Framework for Mining Constraints from Data
Annealed Flow Transport Monte Carlo
A Novel Method to Solve Neural Knapsack Problems
A Novel Sequential Coreset Method for Gradient Descent Algorithms
A Nullspace Property for Subspace-Preserving Recovery
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
Approximate Group Fairness for Clustering
Approximating a Distribution Using Weight Queries
Approximation Theory Based Methods for RKHS Bandits
Approximation Theory of Convolutional Architectures for Time Series Modelling
A Practical Method for Constructing Equivariant Multilayer Perceptrons for Arbitrary Matrix Groups
A Precise Performance Analysis of Support Vector Regression
A Probabilistic Approach to Neural Network Pruning
A Proxy Variable View of Shared Confounding
APS: Active Pretraining with Successor Features
A Receptor Skeleton for Capsule Neural Networks
A Regret Minimization Approach to Iterative Learning Control
A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning
A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance
ARMS: Antithetic-REINFORCE-Multi-Sample Gradient for Binary Variables
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks
A Sampling-Based Method for Tensor Ring Decomposition
A Scalable Deterministic Global Optimization Algorithm for Clustering Problems
A Scalable Second Order Method for Ill-Conditioned Matrix Completion from Few Samples
A Second look at Exponential and Cosine Step Sizes: Simplicity, Adaptivity, and Performance
A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
A statistical perspective on distillation
A Structured Observation Distribution for Generative Biological Sequence Prediction and Forecasting
Asymmetric Heavy Tails and Implicit Bias in Gaussian Noise Injections
Asymmetric Loss Functions for Learning with Noisy Labels
Asymptotic Normality and Confidence Intervals for Prediction Risk of the Min-Norm Least Squares Estimator
Asymptotics of Ridge Regression in Convolutional Models
Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction
Asynchronous Distributed Learning : Adapting to Gradient Delays without Prior Knowledge
A Tale of Two Efficient and Informative Negative Sampling Distributions
A theory of high dimensional regression with arbitrary correlations between input features and target functions: sample complexity, multiple descent curves and a hierarchy of phase transitions
A Theory of Label Propagation for Subpopulation Shift
Attention is not all you need: pure attention loses rank doubly exponentially with depth
Augmented World Models Facilitate Zero-Shot Dynamics Generalization From a Single Offline Environment
A Unified Generative Adversarial Network Training via Self-Labeling and Self-Attention
A Unified Lottery Ticket Hypothesis for Graph Neural Networks
AutoAttend: Automated Attention Representation Search
Autoencoder Image Interpolation by Shaping the Latent Space
Autoencoding Under Normalization Constraints
Automatic variational inference with cascading flows
Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators
Autoregressive Denoising Diffusion Models for Multivariate Probabilistic Time Series Forecasting
AutoSampling: Search for Effective Data Sampling Schedules
A Value-Function-based Interior-point Method for Non-convex Bi-level Optimization
Average-Reward Off-Policy Policy Evaluation with Function Approximation
A Wasserstein Minimax Framework for Mixed Linear Regression
A Zeroth-Order Block Coordinate Descent Algorithm for Huge-Scale Black-Box Optimization
Backdoor Scanning for Deep Neural Networks through K-Arm Optimization
Backpropagated Neighborhood Aggregation for Accurate Training of Spiking Neural Networks
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining
Barlow Twins: Self-Supervised Learning via Redundancy Reduction
BASE Layers: Simplifying Training of Large, Sparse Models
BASGD: Buffered Asynchronous SGD for Byzantine Learning
BasisDeVAE: Interpretable Simultaneous Dimensionality Reduction and Feature-Level Clustering with Derivative-Based Variational Autoencoders
Batch Value-function Approximation with Only Realizability
Bayesian Algorithm Execution: Estimating Computable Properties of Black-box Functions Using Mutual Information
Bayesian Attention Belief Networks
Bayesian Deep Learning via Subnetwork Inference
Bayesian Optimistic Optimisation with Exponentially Decaying Regret
Bayesian Optimization over Hybrid Spaces
Bayesian Quadrature on Riemannian Data Manifolds
Bayesian Structural Adaptation for Continual Learning
Benchmarks, Algorithms, and Metrics for Hierarchical Disentanglement
Besov Function Approximation and Binary Classification on Low-Dimensional Manifolds Using Convolutional Residual Networks
Best Arm Identification in Graphical Bilinear Bandits
Best Model Identification: A Rested Bandit Formulation
Better Training using Weight-Constrained Stochastic Dynamics
Beyond $log^2(T)$ regret for decentralized bandits in matching markets
Beyond first-order methods in machine learning systems
Beyond the Pareto Efficient Frontier: Constraint Active Search for Multiobjective Experimental Design
Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization
Bias-Free Scalable Gaussian Processes via Randomized Truncations
Bias-Robust Bayesian Optimization via Dueling Bandits
Bias-Variance Reduced Local SGD for Less Heterogeneous Federated Learning
Bilevel Optimization: Convergence Analysis and Enhanced Design
Bilinear Classes: A Structural Framework for Provable Generalization in RL
Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification
Black-box density function estimation using recursive partitioning
Blind Pareto Fairness and Subgroup Robustness
Boosting for Online Convex Optimization
Boosting the Throughput and Accelerator Utilization of Specialized CNN Inference Beyond Increasing Batch Size
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference
BORE: Bayesian Optimization by Density-Ratio Estimation
Breaking the Deadly Triad with a Target Network
Breaking the Limits of Message Passing Graph Neural Networks
Break-It-Fix-It: Unsupervised Learning for Program Repair
Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation
Budgeted Heterogeneous Treatment Effect Estimation
Byzantine-Resilient High-Dimensional SGD with Local Iterations on Heterogeneous Data
Calibrate Before Use: Improving Few-shot Performance of Language Models
Can Subnetwork Structure Be the Key to Out-of-Distribution Generalization?
CARTL: Cooperative Adversarially-Robust Transfer Learning
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization
CATE: Computation-aware Neural Architecture Encoding with Transformers
Catformer: Designing Stable Transformers via Sensitivity Analysis
Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning
Causality-aware counterfactual confounding adjustment as an alternative to linear residualization in anticausal prediction tasks based on linear learners
ChaCha for Online AutoML
Challenges in Deploying and monitoring Machine Learning Systems
Characterizing Fairness Over the Set of Good Models Under Selective Labels
Characterizing Structural Regularities of Labeled Data in Overparameterized Models
Characterizing the Gap Between Actor-Critic and Policy Gradient
Chebyshev Polynomial Codes: Task Entanglement-based Coding for Distributed Matrix Multiplication
CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection
Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels
Classification with Rejection Based on Cost-sensitive Classification
Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed
CLOCS: Contrastive Learning of Cardiac Signals Across Space, Time, and Patients
Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels
Clustered Sampling: Low-Variance and Improved Representativity for Clients Selection in Federated Learning
Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition
Coded-InvNet for Resilient Prediction Serving Systems
Collaborative Bayesian Optimization with Fair Regret
Combinatorial Blocking Bandits with Stochastic Delays
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
CombOptNet: Fit the Right NP-Hard Problem by Learning Integer Programming Constraints
Communication-Efficient Distributed Optimization with Quantized Preconditioners
Communication-Efficient Distributed SVD via Local Power Iterations
Commutative Lie Group VAE for Disentanglement Learning
Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization
Composing Normalizing Flows for Inverse Problems
Compositional Video Synthesis with Action Graphs
Compressed Maximum Likelihood
Concentric mixtures of Mallows models for top-$k$ rankings: sampling and identifiability
Conditional Distributional Treatment Effect with Kernel Conditional Mean Embeddings and U-Statistic Regression
Conditional Temporal Neural Processes with Covariance Loss
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Confidence-Budget Matching for Sequential Budgeted Learning
Confidence Scores Make Instance-dependent Label-noise Learning Possible
Conformal prediction interval for dynamic time-series
Conjugate Energy-Based Models
Connecting Interpretability and Robustness in Decision Trees through Separation
Connecting Optimal Ex-Ante Collusion in Teams to Extensive-Form Correlation: Faster Algorithms and Positive Complexity Results
Connecting Sphere Manifolds Hierarchically for Regularization
Consensus Control for Decentralized Deep Learning
Conservative Objective Models for Effective Offline Model-Based Optimization
Consistent Nonparametric Methods for Network Assisted Covariate Estimation
Consistent regression when oblivious outliers overwhelm
Context-Aware Online Collective Inference for Templated Graphical Models
Continual Learning in the Teacher-Student Setup: Impact of Task Similarity
Continual Learning with Deep Architectures
Continuous Coordination As a Realistic Scenario for Lifelong Learning
Continuous-time Model-based Reinforcement Learning
Contrastive Learning Inverts the Data Generating Process
Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks
Convex Regularization in Monte-Carlo Tree Search
ConvexVST: A Convex Optimization Approach to Variance-stabilizing Transformation
ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
Correcting Exposure Bias for Link Recommendation
Correlation Clustering in Constant Many Parallel Rounds
Counterfactual Credit Assignment in Model-Free Reinforcement Learning
CountSketches, Feature Hashing and the Median of Three
CRFL: Certifiably Robust Federated Learning against Backdoor Attacks
Cross-domain Imitation from Observations
Cross-Gradient Aggregation for Decentralized Learning from Non-IID Data
Cross-model Back-translated Distillation for Unsupervised Machine Translation
Crowdsourcing via Annotator Co-occurrence Imputation and Provable Symmetric Nonnegative Matrix Factorization
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Cryospheric Science and Emergence of Machine Learning
Crystallization Learning with the Delaunay Triangulation
Cumulants of Hawkes Processes are Robust to Observation Noise
CURI: A Benchmark for Productive Concept Learning Under Uncertainty
Cyclically Equivariant Neural Decoders for Cyclic Codes
DAGs with No Curl: An Efficient DAG Structure Learning Approach
DANCE: Enhancing saliency maps using decoys
Dash: Semi-Supervised Learning with Dynamic Thresholding
Data augmentation for deep learning based accelerated MRI reconstruction with limited data
Data Augmentation for Meta-Learning
Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps
Data-efficient Hindsight Off-policy Option Learning
Data-Free Knowledge Distillation for Heterogeneous Federated Learning
Dataset Condensation with Differentiable Siamese Augmentation
Dataset Dynamics via Gradient Flows in Probability Space
Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Debiasing Model Updates for Improving Personalized Federated Training
Decentralized Riemannian Gradient Descent on the Stiefel Manifold
Decentralized Single-Timescale Actor-Critic on Zero-Sum Two-Player Stochastic Games
Deciding What to Learn: A Rate-Distortion Approach
Decision-Making Under Selective Labels: Optimal Finite-Domain Policies and Beyond
Decomposable Submodular Function Minimization via Maximum Flow
Decomposed Mutual Information Estimation for Contrastive Representation Learning
Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices
Decoupling Representation Learning from Reinforcement Learning
Decoupling Value and Policy for Generalization in Reinforcement Learning
Deep Adaptive Design: Amortizing Sequential Bayesian Experimental Design
Deep Coherent Exploration for Continuous Control
Deep Continuous Networks
Deep Generative Learning via Schrödinger Bridge
Deep kernel processes
Deep Latent Graph Matching
Deep Learning for Functional Data Analysis with Adaptive Basis Layers
Deeply-Debiased Off-Policy Interval Estimation
DeepReDuce: ReLU Reduction for Fast Private Inference
Deep Reinforcement Learning amidst Continual Structured Non-Stationarity
DeepWalking Backwards: From Embeddings Back to Graphs
Defense against backdoor attacks via robust covariance estimation
Delving into Deep Imbalanced Regression
Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation
Demystifying Inductive Biases for (Beta-)VAE Based Architectures
Dense for the Price of Sparse: Improved Performance of Sparsely Initialized Networks via a Subspace Offset
Density Constrained Reinforcement Learning
Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers
Detecting Rewards Deterioration in Episodic Reinforcement Learning
Detection of Signal in the Spiked Rectangular Models
DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
DG-LMC: A Turn-key and Scalable Synchronous Distributed MCMC Algorithm via Langevin Monte Carlo within Gibbs
Dichotomous Optimistic Search to Quantify Human Perception
Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution
Differentiable Particle Filtering via Entropy-Regularized Optimal Transport
Differentiable Sorting Networks for Scalable Sorting and Ranking Supervision
Differentiable Spatial Planning using Transformers
Differentially Private Aggregation in the Shuffle Model: Almost Central Accuracy in Almost a Single Message
Differentially Private Bayesian Inference for Generalized Linear Models
Differentially-Private Clustering of Easy Instances
Differentially Private Correlation Clustering
Differentially Private Densest Subgraph Detection
Differentially Private Quantiles
Differentially Private Query Release Through Adaptive Projection
Differentially Private Sliced Wasserstein Distance
Diffusion Earth Mover's Distance and Distribution Embeddings
Diffusion Source Identification on Networks with Statistical Confidence
Dimensionality Reduction for the Sum-of-Distances Metric
Directed Graph Embeddings in Pseudo-Riemannian Manifolds
Directional Bias Amplification
Directional Graph Networks
Disambiguation of Weak Supervision leading to Exponential Convergence rates
Discovering symbolic policies with deep reinforcement learning
Discrete-Valued Latent Preference Matrix Estimation with Graph Side Information
Discretization Drift in Two-Player Games
Discriminative Complementary-Label Learning with Weighted Loss
Disentangling Sampling and Labeling Bias for Learning in Large-output Spaces
Disentangling syntax and semantics in the brain with deep networks
Dissecting Supervised Constrastive Learning
Distributed Nystr\"{o}m Kernel Learning with Communications
Distributed Second Order Methods with Fast Rates and Compressed Communication
Distributionally Robust Optimization with Markovian Data
Distribution-Free Calibration Guarantees for Histogram Binning without Sample Splitting
Ditto: Fair and Robust Federated Learning Through Personalization
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
Domain Generalization using Causal Matching
Don’t Just Blame Over-parametrization for Over-confidence: Theoretical Analysis of Calibration in Binary Classification
DORO: Distributional and Outlier Robust Optimization
Double-Win Quant: Aggressively Winning Robustness of Quantized Deep Neural Networks via Random Precision Training and Inference
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training
DriftSurf: Stable-State / Reactive-State Learning under Concept Drift
Dropout: Explicit Forms and Capacity Control
Dual Principal Component Pursuit for Robust Subspace Learning: Theory and Algorithms for a Holistic Approach
Dueling Convex Optimization
Dynamic Balancing for Model Selection in Bandits and RL
Dynamic Game Theoretic Neural Optimizer
Dynamic Planning and Learning under Recovering Rewards
Efficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games
Efficient Differentiable Simulation of Articulated Bodies
Efficient Generative Modelling of Protein Structure Fragments using a Deep Markov Model
Efficient Iterative Amortized Inference for Learning Symmetric and Disentangled Multi-Object Representations
Efficient Lottery Ticket Finding: Less Data is More
Efficient Message Passing for 0–1 ILPs with Binary Decision Diagrams
EfficientNetV2: Smaller Models and Faster Training
Efficient Online Learning for Dynamic k-Clustering
Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
Efficient Statistical Tests: A Neural Tangent Kernel Approach
Efficient Training of Robust Decision Trees Against Adversarial Examples
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
Elastic Graph Neural Networks
EL-Attention: Memory Efficient Lossless Attention for Generation
Elementary superexpressive activations
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Emergent Social Learning via Multi-agent Reinforcement Learning
Emphatic Algorithms for Deep Reinforcement Learning
Encoding and Decoding Speech From the Human Brain
End-to-End Learning of Coherent Probabilistic Forecasts for Hierarchical Time Series
E(n) Equivariant Graph Neural Networks
Enhancing Robustness of Neural Networks through Fourier Stabilization
Ensemble Bootstrapping for Q-Learning
Environment Inference for Invariant Learning
Equivariant Learning of Stochastic Fields: Gaussian Processes and Steerable Conditional Neural Processes
Equivariant message passing for the prediction of tensorial properties and molecular spectra
Equivariant Networks for Pixelized Spheres
Esther Duflo, Plumbers and Mechanics: How ML can complement RCT in policy experiments
Estimating $\alpha$-Rank from A Few Entries with Low Rank Matrix Completion
Estimating Identifiable Causal Effects on Markov Equivalence Class through Double Machine Learning
Estimation and Quantization of Expected Persistence Diagrams
Evaluating Robustness of Predictive Uncertainty Estimation: Are Dirichlet-based Models Reliable?
Evaluating the Implicit Midpoint Integrator for Riemannian Hamiltonian Monte Carlo
Event Outlier Detection in Continuous Time
Evolving Attention with Residual Convolutions
Exact Gap between Generalization Error and Uniform Convergence in Random Feature Models
Exact Optimization of Conformal Predictors via Incremental and Decremental Learning
Examining and Combating Spurious Features under Distribution Shift
Explainable Automated Graph Representation Learning with Hyperparameter Importance
Explaining Time Series Predictions with Dynamic Masks
Explanations for Monotonic Classifiers.
Exploiting Shared Representations for Personalized Federated Learning
Exploiting structured data for learning contagious diseases under incomplete testing
Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Explore Visual Concept Formation for Image Classification
Exponential Lower Bounds for Batch Reinforcement Learning: Batch RL can be Exponentially Harder than Online RL
Exponentially Many Local Minima in Quantum Neural Networks
Exponential Reduction in Sample Complexity with Learning of Ising Model Dynamics
Expressive 1-Lipschitz Neural Networks for Robust Multiple Graph Learning against Adversarial Attacks
Factor-analytic inverse regression for high-dimension, small-sample dimensionality reduction
Fair Classification with Noisy Protected Attributes: A Framework with Provable Guarantees
Fairness and Bias in Online Selection
Fairness for Image Generation with Uncertain Sensitive Attributes
Fairness of Exposure in Stochastic Bandits
Fair Selective Classification Via Sufficiency
Fast active learning for pure exploration in reinforcement learning
Fast Algorithms for Stackelberg Prediction Game with Least Squares Loss
Faster Kernel Matrix Algebra via Density Estimation
Fast margin maximization via dual acceleration
Fast Projection Onto Convex Smooth Constraints
Fast Sketching of Polynomial Kernels of Polynomial Degree
Fast Stochastic Bregman Gradient Methods: Sharp Analysis and Variance Reduction
f-Domain Adversarial Learning: Theory and Algorithms
Feature Clustering for Support Identification in Extreme Regions
Federated Composite Optimization
Federated Continual Learning with Weighted Inter-client Transfer
Federated Deep AUC Maximization for Hetergeneous Data with a Constant Communication Complexity
Federated Learning of User Verification Models Without Sharing Embeddings
Federated Learning under Arbitrary Communication Patterns
Few-Shot Conformal Prediction with Auxiliary Tasks
Few-shot Language Coordination by Modeling Theory of Mind
Few-Shot Neural Architecture Search
FILTRA: Rethinking Steerable CNN by Filter Transform
Finding k in Latent $k-$ polytope
Finding Relevant Information via a Discrete Fourier Expansion
Finding the Stochastic Shortest Path with Low Regret: the Adversarial Cost and Unknown Transition Case
Finite mixture models do not reliably learn the number of components
Finite-Sample Analysis of Off-Policy Natural Actor-Critic Algorithm
First-Order Methods for Wasserstein Distributionally Robust MDP
Fixed-Parameter and Approximation Algorithms for PCA with Outliers
FL-NTK: A Neural Tangent Kernel-based Framework for Federated Learning Analysis
Flow-based Attribution in Graphical Models: A Recursive Shapley Approach
Fold2Seq: A Joint Sequence(1D)-Fold(3D) Embedding-based Generative Model for Protein Design
Follow-the-Regularized-Leader Routes to Chaos in Routing Games
FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning
From Local Structures to Size Generalization in Graph Neural Networks
From Local to Global Norm Emergence: Dissolving Self-reinforcing Substructures with Incremental Social Instruments
From ML research to ML products: A path towards building models with real-world impact
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Functional Space Analysis of Local GAN Convergence
Function Contrastive Learning of Transferable Meta-Representations
Fundamental Tradeoffs in Distributionally Adversarial Training
Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation
GANMEX: One-vs-One Attributions using GAN-based Model Explainability
Gaussian Process-Based Real-Time Learning for Safety Critical Applications
GBHT: Gradient Boosting Histogram Transform for Density Estimation
Generalised Lipschitz Regularisation Equals Distributional Robustness
Generalizable Episodic Memory for Deep Reinforcement Learning
Generalization Bounds in the Presence of Outliers: a Median-of-Means Study
Generalization Error Bound for Hyperbolic Ordinal Embedding
Generalization Guarantees for Neural Architecture Search with Train-Validation Split
Generalized Doubly Reparameterized Gradient Estimators
Generating images with sparse representations
Generative Adversarial Networks for Markovian Temporal Dynamics: Stochastic Continuous Data Generation
Generative Adversarial Transformers
Generative Causal Explanations for Graph Neural Networks
Generative Particle Variational Inference via Estimation of Functional Gradients
Generative Video Transformer: Can Objects be the Words?
GeomCA: Geometric Evaluation of Data Representations
Geometric convergence of elliptical slice sampling
Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances
Global Convergence of Policy Gradient for Linear-Quadratic Mean-Field Control/Game in Continuous Time
Global inducing point variational posteriors for Bayesian neural networks and deep Gaussian processes
Globally-Robust Neural Networks
Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs
Global Prosody Style Transfer Without Text Transcriptions
GLSearch: Maximum Common Subgraph Detection via Learning to Search
GMAC: A Distributional Perspective on Actor-Critic Framework
GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings
Goal-Conditioned Reinforcement Learning with Imagined Subgoals
GP-Tree: A Gaussian Process Classifier for Few-Shot Incremental Learning
Gradient Disaggregation: Breaking Privacy in Federated Learning by Reconstructing the User Participant Matrix
GRAD-MATCH: Gradient Matching based Data Subset Selection for Efficient Deep Model Training
Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
GRAND: Graph Neural Diffusion
Graph Contrastive Learning Automated
Graph Convolution for Semi-Supervised Classification: Improved Linear Separability and Out-of-Distribution Generalization
Graph Cuts Always Find a Global Optimum for Potts Models (With a Catch)
GraphDF: A Discrete Flow Model for Molecular Graph Generation
Graph Mixture Density Networks
Graph Neural Networks Inspired by Classical Iterative Algorithms
GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training
Grey-box Extraction of Natural Language Models
Grid-Functioned Neural Networks
Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning
Group Fisher Pruning for Practical Network Compression
Group-Sparse Matrix Factorization for Transfer Learning of Word Embeddings
Guarantees for Tuning the Step Size using a Learning-to-Learn Approach
Guided Exploration with Proximal Policy Optimization using a Single Demonstration
HardCoRe-NAS: Hard Constrained diffeRentiable Neural Architecture Search
HAWQ-V3: Dyadic Neural Network Quantization
HEMET: A Homomorphic-Encryption-Friendly Privacy-Preserving Mobile Neural Network Architecture
Heterogeneity for the Win: One-Shot Federated Clustering
Heterogeneous Risk Minimization
"Hey, that's not an ODE": Faster ODE Adjoints via Seminorms
Hierarchical Agglomerative Graph Clustering in Nearly-Linear Time
Hierarchical Clustering of Data Streams: Scalable Algorithms and Approximation Guarantees
Hierarchical VAEs Know What They Don’t Know
High Confidence Generalization for Reinforcement Learning
High-dimensional Experimental Design and Kernel Bandits
High-Dimensional Gaussian Process Inference with Derivatives
High-Performance Large-Scale Image Recognition Without Normalization
Homomorphic Sensing: Sparsity and Noise
HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections
Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers
How and Why to Use Experimental Data to Evaluate Methods for Observational Causal Inference
How could Neural Networks understand Programs?
How Do Adam and Training Strategies Help BNNs Optimization
How Does Loss Function Affect Generalization Performance of Deep Learning? Application to Human Age Estimation
How Framelets Enhance Graph Neural Networks
How Important is the Train-Validation Split in Meta-Learning?
How rotational invariance of common kernels prevents generalization in high dimensions
How to Learn when Data Reacts to Your Model: Performative Gradient Descent
Human-AI Collaboration in Sequential Decision-Making
HyperHyperNetwork for the Design of Antenna Arrays
Hyperparameter Selection for Imitation Learning
I-BERT: Integer-only BERT Quantization
ICML 2021 Workshop on Computational Biology
ICML 2021 Workshop on Unsupervised Reinforcement Learning
ICML Workshop on Algorithmic Recourse
ICML Workshop on Human in the Loop Learning (HILL)
ICML Workshop on Representation Learning for Finance and E-Commerce Applications
ICML Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI
iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients
Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection
Imitation by Predicting Observations
Implicit Bias of Linear RNNs
Implicit-PDF: Non-Parametric Representation of Probability Distributions on the Rotation Manifold
Implicit rate-constrained optimization of non-decomposable objectives
Implicit Regularization in Tensor Factorization
Improved Algorithms for Agnostic Pool-based Active Classification
Improved Confidence Bounds for the Linear Logistic Model and Applications to Bandits
Improved Contrastive Divergence Training of Energy-Based Models
Improved Corruption Robust Algorithms for Episodic Reinforcement Learning
Improved Denoising Diffusion Probabilistic Models
Improved, Deterministic Smoothing for L_1 Certified Robustness
Improved OOD Generalization via Adversarial Training and Pretraing
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
Improved Regret Bounds of Bilinear Bandits using Action Space Analysis
Improving Breadth-Wise Backpropagation in Graph Neural Networks Helps Learning Long-Range Dependencies.
Improving Generalization in Meta-learning via Task Augmentation
Improving Gradient Regularization using Complex-Valued Neural Networks
Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding
Improving Molecular Graph Neural Network Explainability with Orthonormalization and Induced Sparsity
Improving Predictors via Combination Across Diverse Task Categories
Improving Ultrametrics Embeddings Through Coresets
Incentivized Bandit Learning with Self-Reinforcing User Preferences
Incentivizing Compliance with Algorithmic Instruments
In-Database Regression in Input Sparsity Time
Inference for Network Regression Models with Community Structure
Inferring Latent Dynamics Underlying Neural Population Activity via Neural Differential Equations
Inferring serial correlation with dynamic backgrounds
Infinite-Dimensional Optimization for Zero-Sum Games via Variational Transport
Information Obfuscation of Graph Neural Networks
Information-Theoretic Methods for Rigorous, Responsible, and Reliable Machine Learning (ITR3)
INNF+: Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models
Instabilities of Offline RL with Pre-Trained Neural Representation
Instance-Optimal Compressed Sensing via Posterior Sampling
Instance Specific Approximations for Submodular Maximization
Integer Programming for Causal Structure Learning in the Presence of Latent Variables
Integrated Defense for Resilient Graph Matching
Interaction-Grounded Learning
Interactive Learning from Activity Description
Intermediate Layer Optimization for Inverse Problems using Deep Generative Models
International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2021 (FL-ICML'21)
Interpretable Machine Learning in Healthcare
Interpretable Stability Bounds for Spectral Graph Filters
Interpretable Stein Goodness-of-fit Tests on Riemannian Manifold
Interpreting and Disentangling Feature Components of Various Complexity from DNNs
Inverse Constrained Reinforcement Learning
Inverse Decision Modeling: Learning Interpretable Representations of Behavior
Isometric Gaussian Process Latent Variable Model for Dissimilarity Data
Is Pessimism Provably Efficient for Offline RL?
Is Space-Time Attention All You Need for Video Understanding?
Joining datasets via data augmentation in the label space for neural networks
Joint Online Learning and Decision-making via Dual Mirror Descent
Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks
Just Train Twice: Improving Group Robustness without Training Group Information
KD3A: Unsupervised Multi-Source Decentralized Domain Adaptation via Knowledge Distillation
Kernel-Based Reinforcement Learning: A Finite-Time Analysis
Kernel Continual Learning
Kernel Stein Discrepancy Descent
Keyframe-Focused Visual Imitation Learning
KNAS: Green Neural Architecture Search
Knowledge Enhanced Machine Learning Pipeline against Diverse Adversarial Attacks
KO codes: inventing nonlinear encoding and decoding for reliable wireless communication via deep-learning
K-shot NAS: Learnable Weight-Sharing for NAS with K-shot Supernets
Label Distribution Learning Machine
Label Inference Attacks from Log-loss Scores
Label-Only Membership Inference Attacks
LAMDA: Label Matching Deep Domain Adaptation
Large-Margin Contrastive Learning with Distance Polarization Regularizer
Large-Scale Meta-Learning with Continual Trajectory Shifting
Large-Scale Multi-Agent Deep FBSDEs
Large Scale Private Learning via Low-rank Reparametrization
LARNet: Lie Algebra Residual Network for Face Recognition
Latent Programmer: Discrete Latent Codes for Program Synthesis
Latent Space Energy-Based Model of Symbol-Vector Coupling for Text Generation and Classification
Learn2Hop: Learned Optimization on Rough Landscapes
Learner-Private Convex Optimization
Learning and Planning in Average-Reward Markov Decision Processes
Learning and Planning in Complex Action Spaces
Learning a Universal Template for Few-shot Dataset Generalization
Learning Binary Decision Trees by Argmin Differentiation
Learning Bounds for Open-Set Learning
Learning by Turning: Neural Architecture Aware Optimisation
Learning Curves for Analysis of Deep Networks
Learning Deep Neural Networks under Agnostic Corrupted Supervision
Learning de-identified representations of prosody from raw audio
Learning disentangled representations via product manifold projection
Learning Diverse-Structured Networks for Adversarial Robustness
Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
Learning from Biased Data: A Semi-Parametric Approach
Learning from History for Byzantine Robust Optimization
Learning from Nested Data with Ornstein Auto-Encoders
Learning from Noisy Labels with No Change to the Training Process
Learning from Similarity-Confidence Data
Learning Generalized Intersection Over Union for Dense Pixelwise Prediction
Learning Gradient Fields for Molecular Conformation Generation
Learning in Nonzero-Sum Stochastic Games with Potentials
Learning Interaction Kernels for Agent Systems on Riemannian Manifolds
Learning Intra-Batch Connections for Deep Metric Learning
Learning Neural Network Subspaces
Learning Node Representations Using Stationary Flow Prediction on Large Payment and Cash Transaction Networks
Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization
Learning Online Algorithms with Distributional Advice
Learning Optimal Auctions with Correlated Valuations from Samples
Learning Queueing Policies for Organ Transplantation Allocation using Interpretable Counterfactual Survival Analysis
Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization
Learning Representations by Humans, for Humans
Learning Routines for Effective Off-Policy Reinforcement Learning
Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation
Learning Stochastic Behaviour from Aggregate Data
Learning Task Informed Abstractions
Learning to Generate Noise for Multi-Attack Robustness
Learning to Price Against a Moving Target
Learning to Rehearse in Long Sequence Memorization
Learning to Weight Imperfect Demonstrations
Learning Transferable Visual Models From Natural Language Supervision
Learning While Playing in Mean-Field Games: Convergence and Optimality
Learn-to-Share: A Hardware-friendly Transfer Learning Framework Exploiting Computation and Parameter Sharing
LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs
Lenient Regret and Good-Action Identification in Gaussian Process Bandits
Let's Agree to Degree: Comparing Graph Convolutional Networks in the Message-Passing Framework
Leveraged Weighted Loss for Partial Label Learning
Leveraging Good Representations in Linear Contextual Bandits
Leveraging Language to Learn Program Abstractions and Search Heuristics
Leveraging Non-uniformity in First-order Non-convex Optimization
Leveraging Public Data for Practical Private Query Release
Leveraging Sparse Linear Layers for Debuggable Deep Networks
LieTransformer: Equivariant Self-Attention for Lie Groups
Light RUMs
LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
Linear Transformers Are Secretly Fast Weight Programmers
Link Prediction with Persistent Homology: An Interactive View
Lipschitz normalization for self-attention layers with application to graph neural networks
Local Algorithms for Finding Densely Connected Clusters
Local Correlation Clustering with Asymmetric Classification Errors
Locally Adaptive Label Smoothing Improves Predictive Churn
Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards
Locally Private k-Means in One Round
Logarithmic Regret for Reinforcement Learning with Linear Function Approximation
LogME: Practical Assessment of Pre-trained Models for Transfer Learning
Lossless Compression of Efficient Private Local Randomizers
Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling
Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not?
Lower-Bounded Proper Losses for Weakly Supervised Classification
Lower Bounds on Cross-Entropy Loss in the Presence of Test-time Adversaries
Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision
Low-Rank Sinkhorn Factorization
LTL2Action: Generalizing LTL Instructions for Multi-Task RL
Machine Learning for Data: Automated Creation, Privacy, Bias
Machine Learning for Molecular Science
Machine Unlearning for Random Forests
Making Paper Reviewing Robust to Bid Manipulation Attacks
Making transport more robust and interpretable by moving data through a small number of anchor points
Mandoline: Model Evaluation under Distribution Shift
Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data
Marginalized Stochastic Natural Gradients for Black-Box Variational Inference
MARINA: Faster Non-Convex Distributed Learning with Compression
Markpainting: Adversarial Machine Learning meets Inpainting
Massively Parallel and Asynchronous Tsetlin Machine Architecture Supporting Almost Constant-Time Scaling
Matrix Completion with Model-free Weighting
Matrix Sketching for Secure Collaborative Machine Learning
Maximum Mean Discrepancy Test is Aware of Adversarial Attacks
MC-LSTM: Mass-Conserving LSTM
Measuring Robustness in Deep Learning Based Compressive Sensing
Mediated Uncoupled Learning: Learning Functions without Direct Input-output Correspondences
Megaverse: Simulating Embodied Agents at One Million Experiences per Second
Memory Efficient Online Meta Learning
Memory-Efficient Pipeline-Parallel DNN Training
Message Passing Adaptive Resonance Theory for Online Active Semi-supervised Learning
Meta-Cal: Well-controlled Post-hoc Calibration by Ranking
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration
Meta-Learning Bidirectional Update Rules
Meta Learning for Support Recovery in High-dimensional Precision Matrix Estimation
Meta-learning Hyperparameter Performance Prediction with Neural Processes
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
Meta-Thompson Sampling
Mind the Box: $l_1$-APGD for Sparse Adversarial Attacks on Image Classifiers
Mixed Cross Entropy Loss for Neural Machine Translation
Mixed Nash Equilibria in the Adversarial Examples Game
Model-based Reinforcement Learning for Continuous Control with Posterior Sampling
Model-Based Reinforcement Learning via Latent-Space Collocation
Model Distillation for Revenue Optimization: Interpretable Personalized Pricing
Model-Free and Model-Based Policy Evaluation when Causality is Uncertain
Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity
Model Fusion for Personalized Learning
Modeling Hierarchical Structures with Continuous Recursive Neural Networks
Modelling Behavioural Diversity for Learning in Open-Ended Games
Model Performance Scaling with Multiple Data Sources
Model-Targeted Poisoning Attacks with Provable Convergence
Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment
Momentum Residual Neural Networks
Monotonic Robust Policy Optimization with Model Discrepancy
Monte Carlo Variational Auto-Encoders
Moreau-Yosida $f$-divergences
More Powerful and General Selective Inference for Stepwise Feature Selection using Homotopy Method
MorphVAE: Generating Neural Morphologies from 3D-Walks using a Variational Autoencoder with Spherical Latent Space
MOTS: Minimax Optimal Thompson Sampling
MSA Transformer
Muesli: Combining Improvements in Policy Optimization
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
Multi-Dimensional Classification via Sparse Label Encoding
Multidimensional Scaling: Approximation and Complexity
Multi-group Agnostic PAC Learnability
Multi-layered Network Exploration via Random Walks: From Offline Optimization to Online Learning
Multiplicative Noise and Heavy Tails in Stochastic Optimization
Multiplying Matrices Without Multiplying
Multi-Receiver Online Bayesian Persuasion
Multiscale Invertible Generative Networks for High-Dimensional Bayesian Inference
Multi-Task Reinforcement Learning with Context-based Representations
MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning
Narrow Margins: Classification, Margins and Fat Tails
Natural-XAI: Explainable AI with Natural Language Explanations
Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation
Near-Optimal Algorithms for Explainable k-Medians and k-Means
Near-Optimal Confidence Sequences for Bounded Random Variables
Near-Optimal Entrywise Anomaly Detection for Low-Rank Matrices with Sub-Exponential Noise
Near-Optimal Linear Regression under Distribution Shift
Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
Near-Optimal Representation Learning for Linear Bandits and Linear RL
Near Optimal Reward-Free Reinforcement Learning
Necessary and sufficient conditions for causal feature selection in time series with latent common causes
Neighborhood Contrastive Learning Applied to Online Patient Monitoring
NeRF-VAE: A Geometry Aware 3D Scene Generative Model
Network Inference and Influence Maximization from Samples
Neural Architecture Search without Training
Neural Feature Matching in Implicit 3D Representations
Neural Pharmacodynamic State Space Modeling
Neural-Pull: Learning Signed Distance Function from Point clouds by Learning to Pull Space onto Surface
Neural Rough Differential Equations for Long Time Series
Neural SDEs as Infinite-Dimensional GANs
Neural Symbolic Regression that scales
Neural Tangent Generalization Attacks
Neural Transformation Learning for Deep Anomaly Detection Beyond Images
Neuro-algorithmic Policies Enable Fast Combinatorial Generalization
Newton Method over Networks is Fast up to the Statistical Precision
Noise and Fluctuation of Finite Learning Rate Stochastic Gradient Descent
Non-Autoregressive Electron Redistribution Modeling for Reaction Prediction
Nondeterminism and Instability in Neural Network Optimization
Non-Exponentially Weighted Aggregation: Regret Bounds for Unbounded Loss Functions
Nonmyopic Multifidelity Acitve Search
Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio Estimation
Nonparametric Decomposition of Sparse Tensors
Nonparametric Hamiltonian Monte Carlo
No-regret Algorithms for Capturing Events in Poisson Point Processes
Not All Memories are Created Equal: Learning to Forget by Expiring
Objective Bound Conditional Gaussian Process for Bayesian Optimization
Object Segmentation Without Labels with Large-Scale Generative Models
Oblivious Sketching-based Central Path Method for Linear Programming
Oblivious Sketching for Logistic Regression
Off-Belief Learning
Offline Contextual Bandits with Overparameterized Models
Offline Meta-Reinforcement Learning with Advantage Weighting
Offline Reinforcement Learning with Fisher Divergence Critic Regularization
Offline Reinforcement Learning with Pseudometric Learning
Off-Policy Confidence Sequences
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
OmniNet: Omnidirectional Representations from Transformers
On a Combination of Alternating Minimization and Nesterov's Momentum
On Characterizing GAN Convergence Through Proximal Duality Gap
On Disentangled Representations Learned from Correlated Data
One for One, or All for All: Equilibria and Optimality of Collaboration in Federated Learning
On Energy-Based Models with Overparametrized Shallow Neural Networks
One Pass Late Fusion Multi-view Clustering
Oneshot Differentially Private Top-k Selection
One-sided Frank-Wolfe algorithms for saddle problems
On Estimation in Latent Variable Models
On Explainability of Graph Neural Networks via Subgraph Explorations
On Learnability via Gradient Method for Two-Layer ReLU Neural Networks in Teacher-Student Setting
On Limited-Memory Subsampling Strategies for Bandits
Online and non-stochastic control
Online A-Optimal Design and Active Linear Regression
On Linear Identifiability of Learned Representations
Online Graph Dictionary Learning
Online Learning for Load Balancing of Unknown Monotone Resource Allocation Games
Online Learning in Unknown Markov Games
Online Learning with Optimism and Delay
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincaré Recurrence
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with √T Regret
Online Selection Problems against Constrained Adversary
Online Submodular Resource Allocation with Applications to Rebalancing Shared Mobility Systems
Online Unrelated Machine Load Balancing with Predictions Revisited
On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization
On Monotonic Linear Interpolation of Neural Network Parameters
On-Off Center-Surround Receptive Fields for Accurate and Robust Image Classification
On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
On-Policy Deep Reinforcement Learning for the Average-Reward Criterion
On Proximal Policy Optimization's Heavy-tailed Gradients
On Recovering from Modeling Errors Using Testing Bayesian Networks
On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP
On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game
On Robust Mean Estimation under Coordinate-level Corruption
On Signal-to-Noise Ratio Issues in Variational Inference for Deep Gaussian Processes
On the Convergence of Hamiltonian Monte Carlo with Stochastic Gradients
On the difficulty of unbiased alpha divergence minimization
On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks
On-the-fly Rectification for Robust Large-Vocabulary Topic Inference
On the Generalization Power of Overfitted Two-Layer Neural Tangent Kernel Models
On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent
On the Inherent Regularization Effects of Noise Injection During Training
On the Optimality of Batch Policy Optimization Algorithms
On the Power of Localized Perceptron for Label-Optimal Learning of Halfspaces with Adversarial Noise
On the Predictability of Pruning Across Scales
On the price of explainability for some clustering problems
On the Problem of Underranking in Group-Fair Ranking
On the Proof of Global Convergence of Gradient Descent for Deep ReLU Networks with Linear Widths
On the Random Conjugate Kernel and Neural Tangent Kernel
On Variational Inference in Biclustering Models
Oops I Took A Gradient: Scalable Sampling for Discrete Distributions
Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics
Operationalizing Complex Causes: A Pragmatic View of Mediation
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Optimal Complexity in Decentralized Training
Optimal Counterfactual Explanations in Tree Ensembles
Optimal Estimation of High Dimensional Smooth Additive Function Based on Noisy Observations
Optimal Non-Convex Exact Recovery in Stochastic Block Model via Projected Power Method
Optimal Off-Policy Evaluation from Multiple Logging Policies
Optimal regret algorithm for Pseudo-1d Bandit Convex Optimization
Optimal Streaming Algorithms for Multi-Armed Bandits
Optimal Thompson Sampling strategies for support-aware CVaR bandits
Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search
Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth
Optimization Planning for 3D ConvNets
Optimizing Black-box Metrics with Iterative Example Weighting
Optimizing persistent homology based functions
Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation
Order Matters: Probabilistic Modeling of Node Sequence for Graph Generation
Outlier-Robust Optimal Transport
Out-of-Distribution Generalization via Risk Extrapolation (REx)
Outside the Echo Chamber: Optimizing the Performative Risk
Overcoming Catastrophic Forgetting by Bayesian Generative Regularization
Over-parameterization: Pitfalls and Opportunities
PAC-Learning for Strategic Classification
PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees
PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization
PAPRIKA: Private Online False Discovery Rate Control
Parallel and Flexible Sampling from Autoregressive Models via Langevin Dynamics
Parallel Droplet Control in MEDA Biochips using Multi-Agent Reinforcement Learning
Parallelizing Legendre Memory Unit Training
Parallel tempering on optimized paths
Parameter-free Locally Accelerated Conditional Gradients
Parameterless Transductive Feature Re-representation for Few-Shot Learning
Parametric Graph for Unimodal Ranking Bandit
Pareto GAN: Extending the Representational Power of GANs to Heavy-Tailed Distributions
Partially Observed Exchangeable Modeling
Path Planning using Neural A* Search
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Perceiver: General Perception with Iterative Attention
Permutation Weighting
Personalized Federated Learning using Hypernetworks
Phase Transitions, Distance Functions, and Implicit Neural Representations
Phasic Policy Gradient
PHEW : Constructing Sparse Networks that Learn Fast and Generalize Well without Training Data
PID Accelerated Value Iteration Algorithm
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models
PixelTransformer: Sample Conditioned Signal Generation
PODS: Policy Optimization via Differentiable Simulation
Pointwise Binary Classification with Pairwise Confidence Comparisons
Poisson-Randomised DirBN: Large Mutation is Needed in Dirichlet Belief Networks
Policy Analysis using Synthetic Controls in Continuous-Time
Policy Caches with Successor Features
Policy Gradient Bayesian Robust Optimization for Imitation Learning
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Poolingformer: Long Document Modeling with Pooling Attention
PopSkipJump: Decision-Based Attack for Probabilistic Classifiers
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization
Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods
Post-selection inference with HSIC-Lasso
Practical and Private (Deep) Learning Without Sampling or Shuffling
Prediction-Centric Learning of Independent Cascade Dynamics from Partial Observations
Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers
Preferential Temporal Difference Learning
Principal Bit Analysis: Autoencoding with Schur-Concave Loss
Principal Component Hierarchy for Sparse Quadratic Programs
Principled Exploration via Optimistic Bootstrapping and Backward Induction
Principled Simplicial Neural Networks for Trajectory Prediction
Prior Image-Constrained Reconstruction using Style-Based Generative Models
Prioritized Level Replay
Privacy in learning: Basics and the interplay
Privacy-Preserving Feature Selection with Secure Multiparty Computation
Privacy-Preserving Video Classification with Convolutional Neural Networks
Private Adaptive Gradient Methods for Convex Optimization
Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates
Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry
Probabilistic Generating Circuits
Probabilistic Programs with Stochastic Conditioning
Probabilistic Sequential Shrinking: A Best Arm Identification Algorithm for Stochastic Bandits with Corruptions
Problem Dependent View on Structured Thresholding Bandit Problems
ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations
Progressive-Scale Boundary Blackbox Attack via Projective Gradient Estimation
Projection Robust Wasserstein Barycenters
Projection techniques to update the truncated SVD of evolving matrices with applications
Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise
Provable Lipschitz Certification for Generative Models
Provable Meta-Learning of Linear Representations
Provable Robustness of Adversarial Training for Learning Halfspaces with Noise
Provably Correct Optimization and Exploration with Non-linear Policies
Provably Efficient Algorithms for Multi-Objective Competitive RL
Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions
Provably Efficient Learning of Transferable Rewards
Provably Efficient Reinforcement Learning for Discounted MDPs with Feature Mapping
Provably End-to-end Label-noise Learning without Anchor Points
Provably Strict Generalisation Benefit for Equivariant Models
Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction
PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning
Pure Exploration and Regret Minimization in Matching Bandits
Putting the ``Learning" into Learning-Augmented Algorithms for Frequency Estimation
Quantifying and Reducing Bias in Maximum Likelihood Estimation of Structured Anomalies
Quantifying Availability and Discovery in Recommender Systems via Stochastic Reachability
Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding
Quantifying the Benefit of Using Differentiable Learning over Tangent Kernels
Quantile Bandits for Best Arms Identification
Quantitative Understanding of VAE as a Non-linearly Scaled Isometric Embedding
Quantization Algorithms for Random Fourier Features
Quantum algorithms for reinforcement learning with a generative model
Quasi-global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data
Query Complexity of Adversarial Attacks
Randomized Algorithms for Submodular Function Maximization with a $k$-System Constraint
Randomized Dimensionality Reduction for Facility Location and Single-Linkage Clustering
Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
Randomized Exploration in Reinforcement Learning with General Value Function Approximation
Random Matrix Theory and ML (RMT+ML)
Rate-Distortion Analysis of Minimum Excess Risk in Bayesian Learning
RATT: Leveraging Unlabeled Data to Guarantee Generalization
Reasoning Over Virtual Knowledge Bases With Open Predicate Relations
Recomposing the Reinforcement Learning Building Blocks with Hypernetworks
Recovering AES Keys with a Deep Cold Boot Attack
Regret and Cumulative Constraint Violation Analysis for Online Convex Optimization with Long Term Constraints
Regret Minimization in Stochastic Non-Convex Learning via a Proximal-Gradient Approach
Regularized Online Allocation Problems: Fairness and Beyond
Regularized Submodular Maximization at Scale
Regularizing towards Causal Invariance: Linear Models with Proxies
Reinforcement Learning for Cost-Aware Markov Decision Processes
Reinforcement Learning for Real Life
Reinforcement Learning of Implicit and Explicit Control Flow Instructions
Reinforcement Learning Under Moral Uncertainty
Reinforcement Learning with Prototypical Representations
Relative Deviation Margin Bounds
Relative Positional Encoding for Transformers with Linear Complexity
REPAINT: Knowledge Transfer in Deep Reinforcement Learning
Representational aspects of depth and conditioning in normalizing flows
Representation Matters: Assessing the Importance of Subgroup Allocations in Training Data
Representation Matters: Offline Pretraining for Sequential Decision Making
Representation Subspace Distance for Domain Adaptation Regression
Reserve Price Optimization for First Price Auctions in Display Advertising
Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism
Responsible AI in Industry: Practical Challenges and Lessons Learned
Rethinking Drug Discovery in the Era of Digital Biology
Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives
Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
Re-understanding Finite-State Representations of Recurrent Policy Networks
Revealing the Structure of Deep Neural Networks via Convex Duality
Revenue-Incentive Tradeoffs in Dynamic Reserve Pricing
Revisiting Peng's Q($\lambda$) for Modern Reinforcement Learning
Revisiting Point Cloud Shape Classification with a Simple and Effective Baseline
Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research
Reward Identification in Inverse Reinforcement Learning
Riemannian Convex Potential Maps
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning
Risk-Sensitive Reinforcement Learning with Function Approximation: A Debiasing Approach
Rissanen Data Analysis: Examining Dataset Characteristics via Description Length
RNNRepair: Automatic RNN Repair via Model-based Analysis
RNN with Particle Flow for Probabilistic Spatio-temporal Forecasting
Robust Asymmetric Learning in POMDPs
Robust Density Estimation from Batches: The Best Things in Life are (Nearly) Free
Robust Inference for High-Dimensional Linear Models via Residual Randomization
Robust Learning-Augmented Caching: An Experimental Study
Robust Learning for Data Poisoning Attacks
Robust Policy Gradient against Strong Data Corruption
Robust Pure Exploration in Linear Bandits with Limited Budget
Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees
Robust Representation Learning via Perceptual Similarity Metrics
Robust Testing and Estimation under Manipulation Attacks
Robust Unsupervised Learning via L-statistic Minimization
RRL: Resnet as representation for Reinforcement Learning
Run-Sort-ReRun: Escaping Batch Size Limitations in Sliced Wasserstein Generative Models
Safe Reinforcement Learning Using Advantage-Based Intervention
Safe Reinforcement Learning with Linear Function Approximation
SagaNet: A Small Sample Gated Network for Pediatric Cancer Diagnosis
SAINT-ACC: Safety-Aware Intelligent Adaptive Cruise Control for Autonomous Vehicles Using Deep Reinforcement Learning
Sample Complexity of Robust Linear Classification on Separated Data
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
Sample-Optimal PAC Learning of Halfspaces with Malicious Noise
Sawtooth Factorial Topic Embeddings Guided Gamma Belief Network
Scalable Certified Segmentation via Randomized Smoothing
Scalable Computations of Wasserstein Barycenter via Input Convex Neural Networks
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
Scalable Marginal Likelihood Estimation for Model Selection in Deep Learning
Scalable Normalizing Flows for Permutation Invariant Densities
Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More
Scalable Variational Gaussian Processes via Harmonic Kernel Decomposition
Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
Scaling Properties of Deep Residual Networks
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
Segmenting Hybrid Trajectories using Latent ODEs
Selecting Data Augmentation for Simulating Interventions
Self-Attention for Computer Vision
Self-Damaging Contrastive Learning
Self-Improved Retrosynthetic Planning
Selfish Sparse RNN Training
Self Normalizing Flows
Self-Paced Context Evaluation for Contextual Reinforcement Learning
Self-supervised and Supervised Joint Training for Resource-rich Machine Translation
Self-supervised Graph-level Representation Learning with Local and Global Structure
Self-Supervised Learning for Reasoning and Perception
Self-Tuning for Data-Efficient Deep Learning
Sequential Domain Adaptation by Synthesizing Distributionally Robust Experts
SGA: A Robust Algorithm for Partial Recovery of Tree-Structured Graphical Models with Noisy Samples
SGLB: Stochastic Gradient Langevin Boosting
SG-PALM: a Fast Physically Interpretable Tensor Graphical Model
Sharf: Shape-conditioned Radiance Fields from a Single View
Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer
Sharper Generalization Bounds for Clustering
Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
SiameseXML: Siamese Networks meet Extreme Classifiers with 100M Labels
SigGPDE: Scaling Sparse Gaussian Processes on Sequential Data
Signatured Deep Fictitious Play for Mean Field Games with Common Noise
SimAM: A Simple, Parameter-Free Attention Module for Convolutional Neural Networks
Simple and Effective VAE Training with Calibrated Decoders
Simultaneous Similarity-based Self-Distillation for Deep Metric Learning
Single Pass Entrywise-Transformed Low Rank Approximation
SinIR: Efficient General Image Manipulation with Single Image Reconstruction
Sinkhorn Label Allocation: Semi-Supervised Classification via Annealed Self-Training
Size-Invariant Graph Representations for Graph Classification Extrapolations
SketchEmbedNet: Learning Novel Concepts by Imitating Drawings
Skew Orthogonal Convolutions
SKIing on Simplices: Kernel Interpolation on the Permutohedral Lattice for Scalable Gaussian Processes
Skill Discovery for Exploration and Planning using Deep Skill Graphs
Sliced Iterative Normalizing Flows
Slot Machines: Discovering Winning Combinations of Random Weights in Neural Networks
SMG: A Shuffling Gradient-Based Method with Momentum
Smooth $p$-Wasserstein Distance: Structure, Empirical Approximation, and Statistical Applications
Social Implications of Large Language Models
Soft then Hard: Rethinking the Quantization in Neural Image Compression
Solving Challenging Dexterous Manipulation Tasks With Trajectory Optimisation and Reinforcement Learning
Solving high-dimensional parabolic PDEs using the tensor train format
Solving Inverse Problems with a Flow-based Noise Model
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform
SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation
Sparse and Imperceptible Adversarial Attack via a Homotopy Algorithm
Sparse Bayesian Learning via Stepwise Regression
SparseBERT: Rethinking the Importance Analysis in Self-attention
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient
Sparse within Sparse Gaussian Processes using Neighbor Information
Sparsifying Networks via Subdifferential Inclusion
Sparsity-Agnostic Lasso Bandit
Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective
Spectral Smoothing Unveils Phase Transitions in Hierarchical Variational Autoencoders
Spectral vertex sparsifiers and pair-wise spanners over distributed graphs
SpreadsheetCoder: Formula Prediction from Semi-structured Context
Stability and Convergence of Stochastic Gradient Clipping: Beyond Lipschitz Continuity and Smoothness
Stability and Generalization of Stochastic Gradient Methods for Minimax Problems
Stabilizing Equilibrium Models by Jacobian Regularization
State Entropy Maximization with Random Encoders for Efficient Exploration
State Relevance for Off-Policy Evaluation
Statistical Estimation from Dependent Data
Stochastic Iterative Graph Matching
Stochastic Multi-Armed Bandits with Unrestricted Delay Distributions
Stochastic Sign Descent Methods: New Algorithms and Better Theory
Straight to the Gradient: Learning to Use Novel Tokens for Neural Text Generation
Strategic Classification in the Dark
Strategic Classification Made Practical
Streaming and Distributed Algorithms for Robust Column Subset Selection
Streaming Bayesian Deep Tensor Factorization
STRODE: Stochastic Boundary Ordinary Differential Equation
Structured Convolutional Kernel Networks for Airline Crew Scheduling
Structured World Belief for Reinforcement Learning in POMDP
Submodular Maximization subject to a Knapsack Constraint: Combinatorial Algorithms with Near-optimal Adaptive Complexity
Subset Selection in Machine Learning: From Theory to Applications
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
Supervised Tree-Wasserstein Distance
Symmetric Spaces for Graph Embeddings: A Finsler-Riemannian Approach
Synthesizer: Rethinking Self-Attention for Transformer Models
Synthetic Healthcare Data Generation and Assessment: Challenges, Methods, and Impact on Machine Learning
Systematic Analysis of Cluster Similarity Indices: How to Validate Validation Measures
Tackling Climate Change with Machine Learning
Targeted Data Acquisition for Evolving Negotiation Agents
Task-Optimal Exploration in Linear Dynamical Systems
Taylor Expansion of Discount Factors
TeachMyAgent: a Benchmark for Automatic Curriculum Learning in Deep RL
Temporal Difference Learning as Gradient Splitting
Temporally Correlated Task Scheduling for Sequence Learning
Temporal Predictive Coding For Model-Based Planning In Latent Space
TempoRL: Learning When to Act
Tensor Programs IIb: Architectural Universality Of Neural Tangent Kernel Training Dynamics
Tensor Programs IV: Feature Learning in Infinite-Width Neural Networks
TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Testing DNN-based Autonomous Driving Systems under Critical Environmental Conditions
Testing Group Fairness via Optimal Transport Projections
TFix: Learning to Fix Coding Errors with a Text-to-Text Transformer
The Distributed Discrete Gaussian Mechanism for Federated Learning with Secure Aggregation
The Earth Mover's Pinball Loss: Quantiles for Histogram-Valued Regression
The Emergence of Individuality
The Heavy-Tail Phenomenon in SGD
The Hintons in your Neural Network: a Quantum Field Theory View of Deep Learning
The Impact of Record Linkage on Learning from Feature Partitioned Data
The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
The Limits of Min-Max Optimization Algorithms: Convergence to Spurious Non-Critical Sets
The Lipschitz Constant of Self-Attention
The Logical Options Framework
The Neglected Assumptions In Causal Inference
Theory and Foundation of Continual Learning
Theory and Practice of Differential Privacy
Theory of Spectral Method for Union of Subspaces-Based Random Geometry Graph
The Power of Adaptivity for Stochastic Submodular Cover
The Power of Log-Sum-Exp: Sequential Density Ratio Matrix Estimation for Speed-Accuracy Optimization
The Symmetry between Arms and Knapsacks: A Primal-Dual Approach for Bandits with Knapsacks
Think Global and Act Local: Bayesian Optimisation over High-Dimensional Categorical and Mixed Search Spaces
Thinking Like Transformers
Three Operator Splitting with a Nonconvex Loss Function
Tight Bounds on the Smallest Eigenvalue of the Neural Tangent Kernel for Deep ReLU Networks
Tightening the Dependence on Horizon in the Sample Complexity of Q-Learning
Tighter Bounds on the Log Marginal Likelihood of Gaussian Process Regression Using Conjugate Gradients
Tilting the playing field: Dynamical loss functions for machine learning
Time Series Workshop
To be Robust or to be Fair: Towards Fairness in Adversarial Training
Top-k eXtreme Contextual Bandits with Arm Hierarchy
Toward Better Generalization Bounds with Locally Elastic Stability
Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing
Towards Better Robust Generalization with Shift Consistency Regularization
Towards Certifying L-infinity Robustness using Neural Networks with L-inf-dist Neurons
Towards Defending against Adversarial Examples via Attack-Invariant Features
Towards Distraction-Robust Active Visual Tracking
Towards Domain-Agnostic Contrastive Learning
Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning
Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach
Towards Practical Mean Bounds for Small Samples
Towards Rigorous Interpretations: a Formalisation of Feature Attribution
Towards the Unification and Robustness of Perturbation and Gradient Based Explanations
Towards Tight Bounds on the Sample Complexity of Average-reward MDPs
Towards Understanding and Mitigating Social Biases in Language Models
Towards Understanding Learning in Neural Networks with Linear Teachers
Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Tractable structured natural-gradient descent using local parameterizations
Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling
Training data-efficient image transformers & distillation through attention
Training Data Subset Selection for Regression with Controlled Generalization Error
Training Graph Neural Networks with 1000 Layers
Training Quantized Neural Networks to Global Optimality via Semidefinite Programming
Training Recurrent Neural Networks via Forward Propagation Through Time
Train simultaneously, generalize better: Stability of gradient-based minimax learners
Trajectory Diversity for Zero-Shot Coordination
Transfer-Based Semantic Anomaly Detection
Trees with Attention for Set Prediction Tasks
T-SCI: A Two-Stage Conformal Inference Algorithm with Guaranteed Coverage for Cox-MLP
Two Heads are Better Than One: Hypergraph-Enhanced Graph Reasoning for Visual Event Ratiocination
Two-way kernel matrix puncturing: towards resource-efficient PCA and spectral clustering
UCB Momentum Q-learning: Correcting the bias without forgetting
Unbalanced minibatch Optimal Transport; applications to Domain Adaptation
Unbiased Gradient Estimation in Unrolled Computation Graphs with Persistent Evolution Strategies
Uncertainty and Robustness in Deep Learning
Uncertainty Principles of Encoding GANs
Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning
Uncovering the Connections Between Adversarial Transferability and Knowledge Transferability
Understanding and Mitigating Accuracy Disparity in Regression
Understanding Failures in Out-of-Distribution Detection with Deep Generative Models
Understanding Instance-Level Label Noise: Disparate Impacts and Treatments
Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers
Understanding Noise Injection in GANs
Understanding self-supervised learning dynamics without contrastive pairs
Understanding the Dynamics of Gradient Flow in Overparameterized Linear models
UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
UnICORNN: A recurrent model for learning very long time dependencies
Unified Robust Semi-Supervised Variational Autoencoder
Uniform Convergence, Adversarial Spheres and a Simple Remedy
Unifying Vision-and-Language Tasks via Text Generation
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
Unitary Branching Programs: Learnability and Lower Bounds
Unsupervised Co-part Segmentation through Assembly
Unsupervised Embedding Adaptation via Early-Stage Feature Reconstruction for Few-Shot Classification
Unsupervised Learning for Reinforcement Learning
Unsupervised Learning of Visual 3D Keypoints for Control
Unsupervised Part Representation by Flow Capsules
Unsupervised Representation Learning via Neural Activation Coding
Unsupervised Skill Discovery with Bottleneck Option Learning
Valid Causal Inference with (Some) Invalid Instruments
Value Alignment Verification
Value-at-Risk Optimization with Gaussian Processes
Value Iteration in Continuous Actions, States and Time
Variance Reduced Training with Stratified Sampling for Forecasting Models
Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums
Variational Auto-Regressive Gaussian Processes for Continual Learning
Variational Data Assimilation with a Learned Inverse Observation Operator
Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning
Variational (Gradient) Estimate of the Score Function in Energy-based Latent Variable Models
Vector Quantized Models for Planning
Versatile Verification of Tree Ensembles
ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision
Voice2Series: Reprogramming Acoustic Models for Time Series Classification
Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data
Watermarking Deep Neural Networks with Greedy Residuals
Weight-covariance alignment for adversarially robust neural networks
Weisfeiler and Lehman Go Topological: Message Passing Simplicial Networks
WGAN with an Infinitely Wide Generator Has No Spurious Stationary Points
What Are Bayesian Neural Network Posteriors Really Like?
What does LIME really see in images?
What Does Rotation Prediction Tell Us about Classifier Accuracy under Varying Testing Environments?
What Makes for End-to-End Object Detection?
What's in the Box? Exploring the Inner Life of Neural Networks with Robust Rules
When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC
When Does Data Augmentation Help With Membership Inference Attacks?
Which transformer architecture fits my data? A vocabulary bottleneck in self-attention
Whitening and Second Order Optimization Both Make Information in the Dataset Unusable During Training, and Can Reduce or Prevent Generalization
Whitening for Self-Supervised Representation Learning
Whittle Networks: A Deep Likelihood Model for Time Series
WILDS: A Benchmark of in-the-Wild Distribution Shifts
Winograd Algorithm for AdderNet
Workshop on Computational Approaches to Mental Health @ ICML 2021
Workshop on Distribution-Free Uncertainty Quantification
Workshop on Reinforcement Learning Theory
Workshop on Socially Responsible Machine Learning
World Model as a Graph: Learning Latent Landmarks for Planning
XOR-CD: Linearly Convergent Constrained Structure Generation
You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling
Zero-Shot Knowledge Distillation from a Decision-Based Black-Box Model
Zero-Shot Text-to-Image Generation
Zeroth-Order Non-Convex Learning via Hierarchical Dual Averaging
Z-GCNETs: Time Zigzags at Graph Convolutional Networks for Time Series Forecasting
Zoo-Tuning: Adaptive Transfer from A Zoo of Models
Sparsity in Deep Learning: Pruning and growth for efficient inference and training