78 Results

Poster
Tue 7:00 Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization
Richard Zhang, Daniel Golovin
Poster
Tue 7:00 Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions
Prashanth L.A., Krishna Jagannathan, Ravi Kolla
Poster
Tue 7:00 Online Learning for Active Cache Synchronization
Andrey Kolobov, Sebastien Bubeck, Julian Zimmert
Poster
Tue 7:00 Combinatorial Pure Exploration for Dueling Bandit
Wei Chen, Yihan Du, Longbo Huang, Haoyu Zhao
Poster
Tue 7:00 Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits
Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet
Poster
Tue 8:00 Online Pricing with Offline Data: Phase Transition and Inverse Square Law
Jinzhi Bu, David Simchi-Levi, Yunzong Xu
Poster
Tue 8:00 FormulaZero: Distributionally Robust Online Adaptation via Offline Population Synthesis
Aman Sinha, Matthew O'Kelly, Hongrui Zheng, Rahul Mangharam, John Duchi, Russ Tedrake
Poster
Tue 8:00 Boosting for Control of Dynamical Systems
Naman Agarwal, Nataly Brukhim, Elad Hazan, Zhou Lu
Poster
Tue 8:00 Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
Chi Jin, Tiancheng Jin, Haipeng Luo, Suvrit Sra, Tiancheng Yu
Poster
Tue 9:00 Neural Contextual Bandits with UCB-based Exploration
Dongruo Zhou, Lihong Li, Quanquan Gu
Poster
Tue 9:00 Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
Dylan Foster, Alexander Rakhlin
Poster
Tue 10:00 A simpler approach to accelerated optimization: iterative averaging meets optimism
Pooria Joulani, Anant Raj, András György, Csaba Szepesvari
Poster
Tue 10:00 Logarithmic Regret for Adversarial Online Control
Dylan Foster, Max Simchowitz
Poster
Tue 10:00 Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
Tong Yu, Branislav Kveton, Zheng Wen, Ruiyi Zhang, Ole J. Mengshoel
Poster
Tue 10:00 Online Control of the False Coverage Rate and False Sign Rate
Asaf Weinstein, Aaditya Ramdas
Poster
Tue 11:00 Bandits with Adversarial Scaling
Thodoris Lykouris, Vahab Mirrokni, Renato Leme
Poster
Tue 11:00 Kernel Methods for Cooperative Multi-Agent Contextual Bandits
Abhimanyu Dubey, Alex `Sandy' Pentland
Poster
Tue 11:00 Optimal Non-parametric Learning in Repeated Contextual Auctions with Strategic Buyer
Alexey Drutsa
Poster
Tue 11:00 Near-linear time Gaussian process optimization with adaptive batching and resparsification
Daniele Calandriello, Luigi Carratino, Alessandro Lazaric, Michal Valko, Lorenzo Rosasco
Poster
Tue 12:00 Restarted Bayesian Online Change-point Detector achieves Optimal Detection Delay
REDA ALAMI, Odalric-Ambrym Maillard, Raphaël Féraud
Poster
Tue 13:00 Non-Stationary Delayed Bandits with Intermediate Observations
Claire Vernade, András György, King Tim Mann
Poster
Tue 13:00 Meta-learning with Stochastic Linear Bandits
Leonardo Cella, Alessandro Lazaric, Massimiliano Pontil
Poster
Tue 13:00 Online Convex Optimization in the Random Order Model
Dan Garber, Gal Korcia, Kfir Levy
Poster
Tue 13:00 Learning with Good Feature Representations in Bandits and in RL with a Generative Model
Tor Lattimore, Csaba Szepesvari, Gellért Weisz
Poster
Tue 13:00 Adaptive Sampling for Estimating Probability Distributions
Shubhanshu Shekhar, Tara Javidi, Mohammad Ghavamzadeh
Poster
Tue 14:00 Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently
Asaf Cassel, Alon Cohen, Tomer Koren
Poster
Tue 15:00 Thompson Sampling Algorithms for Mean-Variance Bandits
Qiuyu Zhu, Vincent Tan
Poster
Wed 5:00 Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting
Zixin Zhong, Wang Chi Cheung, Vincent Tan
Poster
Wed 5:00 Fast and Three-rious: Speeding Up Weak Supervision with Triplet Methods
Dan Fu, Mayee Chen, Fred Sala, Sarah Hooper, Kayvon Fatahalian, Christopher Re
Poster
Wed 5:00 Finite-Time Last-Iterate Convergence for Multi-Agent Learning in Games
Darren Lin, Zhengyuan Zhou, Panayotis Mertikopoulos, Michael Jordan
Poster
Wed 5:00 Online Bayesian Moment Matching based SAT Solver Heuristics
Haonan Duan, Saeed Nejati, George Trimponias, Pascal Poupart, Vijay Ganesh
Poster
Wed 8:00 Structure Adaptive Algorithms for Stochastic Bandits
Rémy Degenne, Han Shao, Wouter Koolen
Poster
Wed 8:00 Budgeted Online Influence Maximization
Pierre Perrault, Jen Healey, Zheng Wen, Michal Valko
Poster
Wed 8:00 Robust Outlier Arm Identification
Yinglun Zhu, Sumeet Katariya, Robert Nowak
Poster
Wed 8:00 Online mirror descent and dual averaging: keeping pace in the dynamic case
Huang Fang, Nick Harvey, Victor Sanches Portella, Michael Friedlander
Poster
Wed 8:00 Fine-Grained Analysis of Stability and Generalization for Stochastic Gradient Descent
Yunwen Lei, Yiming Ying
Poster
Wed 8:00 On conditional versus marginal bias in multi-armed bandits
Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo
Poster
Wed 8:00 Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits
Xi Liu, Ping-Chun Hsieh, Yu Heng Hung, Anirban Bhattacharya, P. Kumar
Poster
Wed 8:00 (Locally) Differentially Private Combinatorial Semi-Bandits
Xiaoyu Chen, Kai Zheng, Zixin(Jack) Zhou, Yunchang Yang, Wei Chen, Liwei Wang
Poster
Wed 8:00 Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Wang Chi Cheung, David Simchi-Levi, Ruihao Zhu
Poster
Wed 8:00 Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis
Vidyashankar Sivakumar, Steven Wu, Arindam Banerjee
Poster
Wed 9:00 Optimal Sequential Maximization: One Interview is Enough!
Moein Falahatgar, Alon Orlitsky, Venkatadheeraj Pichapati
Poster
Wed 9:00 Dual Mirror Descent for Online Allocation Problems
Santiago Balseiro, Haihao Lu, Vahab Mirrokni
Poster
Wed 9:00 Cooperative Multi-Agent Bandits with Heavy Tails
Abhimanyu Dubey, Alex `Sandy' Pentland
Poster
Wed 10:00 Stochastic bandits with arm-dependent delays
Anne Gael Manegueu, Claire Vernade, Alexandra Carpentier, Michal Valko
Poster
Wed 11:00 A Finite-Time Analysis of Q-Learning with Neural Network Function Approximation
Pan Xu, Quanquan Gu
Poster
Wed 11:00 Gamification of Pure Exploration for Linear Bandits
Rémy Degenne, Pierre Menard, Xuedong Shang, Michal Valko
Poster
Wed 12:00 My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits
Ilai Bistritz, Tavor Z Baharav, Amir Leshem, Nicholas Bambos
Poster
Wed 12:00 Reserve Pricing in Repeated Second-Price Auctions with Strategic Bidders
Alexey Drutsa
Poster
Wed 14:00 Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei, Mehdi Jafarnia, Haipeng Luo, Hiteshi Sharma, Rahul Jain
Poster
Wed 16:00 Online Dense Subgraph Discovery via Blurred-Graph Feedback
Yuko Kuroki, Atsushi Miyauchi, Junya Honda, Masashi Sugiyama
Poster
Wed 16:00 From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics
Sai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras
Poster
Thu 6:00 Online Learning with Dependent Stochastic Feedback Graphs
Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang
Poster
Thu 6:00 Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards
Aadirupa Saha, Pierre Gaillard, Michal Valko
Poster
Thu 6:00 Improved Bounds on Minimax Regret under Logarithmic Loss via Self-Concordance
Blair Bilodeau, Dylan Foster, Daniel Roy
Poster
Thu 6:00 Bandits for BMO Functions
Tianyu Wang, Cynthia Rudin
Poster
Thu 7:00 No-Regret and Incentive-Compatible Online Learning
Rupert Freeman, David Pennock, Chara Podimata, Jenn Wortman Vaughan
Poster
Thu 7:00 The Intrinsic Robustness of Stochastic Bandits to Strategic Manipulation
Zhe Feng, David Parkes, Haifeng Xu
Poster
Thu 7:00 Online Learning with Imperfect Hints
Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit
Poster
Thu 7:00 When Demands Evolve Larger and Noisier: Learning and Earning in a Growing Environment
Feng Zhu, Zeyu Zheng
Poster
Thu 7:00 Doubly robust off-policy evaluation with shrinkage
Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, Miro Dudik
Poster
Thu 7:00 Model-Based Reinforcement Learning with Value-Targeted Regression
Alex Ayoub, Zeyu Jia, Csaba Szepesvari, Mengdi Wang, Lin Yang
Poster
Thu 8:00 Parameter-free, Dynamic, and Strongly-Adaptive Online Learning
Ashok Cutkosky
Poster
Thu 9:00 Improved Optimistic Algorithms for Logistic Bandits
Louis Faury, Marc Abeille, Clément Calauzènes, Olivier Fercoq
Poster
Thu 9:00 From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model
Aadirupa Saha, Aditya Gopalan
Poster
Thu 12:00 Near-optimal Regret Bounds for Stochastic Shortest Path
Aviv Rosenberg, Alon Cohen, Yishay Mansour, Haim Kaplan
Poster
Thu 12:00 Bisection-Based Pricing for Repeated Contextual Auctions against Strategic Buyer
Anton Zhiyanov, Alexey Drutsa
Poster
Thu 12:00 Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains
Johannes Fischer, Sahin Tas
Poster
Thu 12:00 Real-Time Optimisation for Online Learning in Auctions
Lorenzo Croissant, Marc Abeille, Clément Calauzènes
Poster
Thu 12:00 Naive Exploration is Optimal for Online LQR
Max Simchowitz, Dylan Foster
Poster
Thu 12:00 On Thompson Sampling with Langevin Algorithms
Eric Mazumdar, Aldo Pacchiano, Yian Ma, Michael Jordan, Peter Bartlett
Poster
Thu 12:00 Gradient-free Online Learning in Continuous Games with Delayed Rewards
Amélie Héliou, Panayotis Mertikopoulos, Zhengyuan Zhou
Poster
Thu 13:00 Towards non-parametric drift detection via Dynamic Adapting Window Independence Drift Detection (DAWIDD)
Fabian Hinder, André Artelt, CITEC Barbara Hammer
Poster
Thu 13:00 Neural Topic Modeling with Continual Lifelong Learning
Pankaj Gupta, Yatin Chaudhary, Thomas Runkler, Hinrich Schuetze
Poster
Thu 13:00 Preselection Bandits
Viktor Bengs, Eyke Hüllermeier
Poster
Thu 14:00 Linear bandits with Stochastic Delayed Feedback
Claire Vernade, Alexandra Carpentier, Tor Lattimore, Giovanni Zappella, Beyza Ermis, Michael Brueckner
Poster
Thu 17:00 Multinomial Logit Bandit with Low Switching Cost
Kefan Dong, Yingkai Li, Qin Zhang, Yuan Zhou
Poster
Thu 17:00 Projection-free Distributed Online Convex Optimization with $O(\sqrt{T})$ Communication Complexity
Yuanyu Wan, Wei-Wei Tu, Lijun Zhang