51 Results

Poster
Tue 7:00 Concentration bounds for CVaR estimation: The cases of light-tailed and heavy-tailed distributions
Prashanth L.A., Krishna Jagannathan, Ravi Kolla
Poster
Tue 7:00 Online Learning for Active Cache Synchronization
Andrey Kolobov, Sebastien Bubeck, Julian Zimmert
Poster
Tue 7:00 Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits
Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet
Poster
Tue 7:00 Combinatorial Pure Exploration for Dueling Bandit
Wei Chen, Yihan Du, Longbo Huang, Haoyu Zhao
Poster
Tue 7:00 Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization
Richard Zhang, Daniel Golovin
Poster
Tue 8:00 Online Pricing with Offline Data: Phase Transition and Inverse Square Law
Jinzhi Bu, David Simchi-Levi, Yunzong Xu
Poster
Tue 9:00 Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles
Dylan Foster, Alexander Rakhlin
Poster
Tue 9:00 Neural Contextual Bandits with UCB-based Exploration
Dongruo Zhou, Lihong Li, Quanquan Gu
Poster
Tue 10:00 Influence Diagram Bandits: Variational Thompson Sampling for Structured Bandit Problems
Tong Yu, Branislav Kveton, Zheng Wen, Ruiyi Zhang, Ole J. Mengshoel
Poster
Tue 10:00 A simpler approach to accelerated optimization: iterative averaging meets optimism
Pooria Joulani, Anant Raj, András György, Csaba Szepesvari
Poster
Tue 10:00 Online Control of the False Coverage Rate and False Sign Rate
Asaf Weinstein, Aaditya Ramdas
Poster
Tue 11:00 Near-linear time Gaussian process optimization with adaptive batching and resparsification
Daniele Calandriello, Luigi Carratino, Alessandro Lazaric, Michal Valko, Lorenzo Rosasco
Poster
Tue 11:00 Bandits with Adversarial Scaling
Thodoris Lykouris, Vahab Mirrokni, Renato Leme
Poster
Tue 12:00 Restarted Bayesian Online Change-point Detector achieves Optimal Detection Delay
REDA ALAMI, Odalric-Ambrym Maillard, Raphaël Féraud
Poster
Tue 13:00 Adaptive Sampling for Estimating Probability Distributions
Shubhanshu Shekhar, Tara Javidi, Mohammad Ghavamzadeh
Poster
Tue 13:00 Non-Stationary Delayed Bandits with Intermediate Observations
Claire Vernade, András György, King Tim Mann
Poster
Tue 13:00 Online Convex Optimization in the Random Order Model
Dan Garber, Gal Korcia, Kfir Levy
Poster
Tue 15:00 Thompson Sampling Algorithms for Mean-Variance Bandits
Qiuyu Zhu, Vincent Tan
Poster
Wed 5:00 Best Arm Identification for Cascading Bandits in the Fixed Confidence Setting
Zixin Zhong, Wang Chi Cheung, Vincent Tan
Poster
Wed 8:00 Structured Linear Contextual Bandits: A Sharp and Geometric Smoothed Analysis
Vidyashankar Sivakumar, Steven Wu, Arindam Banerjee
Poster
Wed 8:00 Robust Outlier Arm Identification
Yinglun Zhu, Sumeet Katariya, Robert Nowak
Poster
Wed 8:00 Structure Adaptive Algorithms for Stochastic Bandits
Rémy Degenne, Han Shao, Wouter Koolen
Poster
Wed 8:00 On conditional versus marginal bias in multi-armed bandits
Jaehyeok Shin, Aaditya Ramdas, Alessandro Rinaldo
Poster
Wed 8:00 Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits
Xi Liu, Ping-Chun Hsieh, Yu Heng Hung, Anirban Bhattacharya, P. Kumar
Poster
Wed 8:00 Online mirror descent and dual averaging: keeping pace in the dynamic case
Huang Fang, Nick Harvey, Victor Sanches Portella, Michael Friedlander
Poster
Wed 8:00 Budgeted Online Influence Maximization
Pierre Perrault, Jen Healey, Zheng Wen, Michal Valko
Poster
Wed 8:00 Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
Wang Chi Cheung, David Simchi-Levi, Ruihao Zhu
Poster
Wed 9:00 Dual Mirror Descent for Online Allocation Problems
Santiago Balseiro, Haihao Lu, Vahab Mirrokni
Poster
Wed 10:00 Stochastic bandits with arm-dependent delays
Anne Gael Manegueu, Claire Vernade, Alexandra Carpentier, Michal Valko
Poster
Wed 11:00 Gamification of Pure Exploration for Linear Bandits
Rémy Degenne, Pierre Menard, Xuedong Shang, Michal Valko
Poster
Wed 12:00 My Fair Bandit: Distributed Learning of Max-Min Fairness with Multi-player Bandits
Ilai Bistritz, Tavor Z Baharav, Amir Leshem, Nicholas Bambos
Poster
Thu 6:00 Bandits for BMO Functions
Tianyu Wang, Cynthia Rudin
Poster
Thu 6:00 Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards
Aadirupa Saha, Pierre Gaillard, Michal Valko
Poster
Thu 6:00 Adaptive Region-Based Active Learning
Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang
Poster
Thu 6:00 Online Learning with Dependent Stochastic Feedback Graphs
Corinna Cortes, Giulia DeSalvo, Claudio Gentile, Mehryar Mohri, Ningshan Zhang
Poster
Thu 6:00 Active Learning on Attributed Graphs via Graph Cognizant Logistic Regression and Preemptive Query Generation
Florence Regol, Soumyasundar Pal, Yingxue Zhang, Mark Coates
Poster
Thu 6:00 Improved Bounds on Minimax Regret under Logarithmic Loss via Self-Concordance
Blair Bilodeau, Dylan Foster, Daniel Roy
Poster
Thu 7:00 When Demands Evolve Larger and Noisier: Learning and Earning in a Growing Environment
Feng Zhu, Zeyu Zheng
Poster
Thu 7:00 Online Learning with Imperfect Hints
Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar, Manish Purohit
Poster
Thu 7:00 Doubly robust off-policy evaluation with shrinkage
Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, Miro Dudik
Poster
Thu 8:00 Parameter-free, Dynamic, and Strongly-Adaptive Online Learning
Ashok Cutkosky
Poster
Thu 9:00 Improved Optimistic Algorithms for Logistic Bandits
Louis Faury, Marc Abeille, Clément Calauzènes, Olivier Fercoq
Poster
Thu 9:00 From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model
Aadirupa Saha, Aditya Gopalan
Poster
Thu 12:00 Near-optimal Regret Bounds for Stochastic Shortest Path
Aviv Rosenberg, Alon Cohen, Yishay Mansour, Haim Kaplan
Poster
Thu 12:00 On Thompson Sampling with Langevin Algorithms
Eric Mazumdar, Aldo Pacchiano, Yian Ma, Michael Jordan, Peter Bartlett
Poster
Thu 12:00 Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continuous Domains
Johannes Fischer, Sahin Tas
Poster
Thu 12:00 Bisection-Based Pricing for Repeated Contextual Auctions against Strategic Buyer
Anton Zhiyanov, Alexey Drutsa
Poster
Thu 13:00 Preselection Bandits
Viktor Bengs, Eyke Hüllermeier
Poster
Thu 14:00 Linear bandits with Stochastic Delayed Feedback
Claire Vernade, Alexandra Carpentier, Tor Lattimore, Giovanni Zappella, Beyza Ermis, Michael Brueckner
Poster
Thu 17:00 Multinomial Logit Bandit with Low Switching Cost
Kefan Dong, Yingkai Li, Qin Zhang, Yuan Zhou
Poster
Thu 17:00 Projection-free Distributed Online Convex Optimization with $O(\sqrt{T})$ Communication Complexity
Yuanyu Wan, Wei-Wei Tu, Lijun Zhang