Filter by Keyword:

32 Results

Expo Workshop
Sun 5:00 Real World RL: Azure Personalizer & Vowpal Wabbit
Sheetal Lahabar, Etienne Kintzler, Mark Rucker, Bogdan Mazoure, Qingyun Wu, Pavithra Srinath, Jack Gerrits, Olga Vrousgou, John Langford, Eduardo Salinas
Spotlight
Tue 5:40 On the Optimality of Batch Policy Optimization Algorithms
Chenjun Xiao, Yifan Wu, Jincheng Mei, Bo Dai, Tor Lattimore, Lihong Li, Csaba Szepesvari, Dale Schuurmans
Poster
Tue 9:00 On the Optimality of Batch Policy Optimization Algorithms
Chenjun Xiao, Yifan Wu, Jincheng Mei, Bo Dai, Tor Lattimore, Lihong Li, Csaba Szepesvari, Dale Schuurmans
Oral
Wed 7:00 High-dimensional Experimental Design and Kernel Bandits
Romain Camilleri, Kevin Jamieson, Julian Katz-Samuels
Poster
Wed 9:00 High-dimensional Experimental Design and Kernel Bandits
Romain Camilleri, Kevin Jamieson, Julian Katz-Samuels
Oral
Wed 17:00 The Symmetry between Arms and Knapsacks: A Primal-Dual Approach for Bandits with Knapsacks
Xiaocheng Li, Chunlin Sun, Yinyu Ye
Spotlight
Wed 17:30 Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously
Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang, Xiaojin Zhang
Oral
Wed 18:00 Cyclically Equivariant Neural Decoders for Cyclic Codes
Xiangyu Chen, Min Ye
Spotlight
Wed 18:40 Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits
Tianyuan Jin, Jing Tang, Pan Xu, Keke Huang, Xiaokui Xiao, Quanquan Gu
Spotlight
Wed 19:45 Adapting to misspecification in contextual bandits with offline regression oracles
Sanath Kumar Krishnamurthy, Vitor Hadad, Susan Athey
Poster
Wed 21:00 Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits
Tianyuan Jin, Jing Tang, Pan Xu, Keke Huang, Xiaokui Xiao, Quanquan Gu
Poster
Wed 21:00 The Symmetry between Arms and Knapsacks: A Primal-Dual Approach for Bandits with Knapsacks
Xiaocheng Li, Chunlin Sun, Yinyu Ye
Poster
Wed 21:00 Achieving Near Instance-Optimality and Minimax-Optimality in Stochastic and Adversarial Linear Bandits Simultaneously
Chung-Wei Lee, Haipeng Luo, Chen-Yu Wei, Mengxiao Zhang, Xiaojin Zhang
Poster
Wed 21:00 Adapting to misspecification in contextual bandits with offline regression oracles
Sanath Kumar Krishnamurthy, Vitor Hadad, Susan Athey
Poster
Wed 21:00 Cyclically Equivariant Neural Decoders for Cyclic Codes
Xiangyu Chen, Min Ye
Spotlight
Thu 6:40 Decoupling Representation Learning from Reinforcement Learning
Adam Stooke, Kimin Lee, Pieter Abbeel, Michael Laskin
Poster
Thu 9:00 Decoupling Representation Learning from Reinforcement Learning
Adam Stooke, Kimin Lee, Pieter Abbeel, Michael Laskin
Spotlight
Thu 19:25 Link Prediction with Persistent Homology: An Interactive View
Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen
Spotlight
Thu 20:40 On Limited-Memory Subsampling Strategies for Bandits
Dorian Baudry, Yoan Russac, Olivier Cappé
Spotlight
Thu 20:45 Problem Dependent View on Structured Thresholding Bandit Problems
James Cheshire, Pierre MENARD, Alexandra Carpentier
Spotlight
Thu 20:45 CURI: A Benchmark for Productive Concept Learning Under Uncertainty
Rama Vedantam, Arthur Szlam, Max Nickel, Ari Morcos, Brenden Lake
Poster
Thu 21:00 On Limited-Memory Subsampling Strategies for Bandits
Dorian Baudry, Yoan Russac, Olivier Cappé
Poster
Thu 21:00 Link Prediction with Persistent Homology: An Interactive View
Zuoyu Yan, Tengfei Ma, Liangcai Gao, Zhi Tang, Chao Chen
Poster
Thu 21:00 CURI: A Benchmark for Productive Concept Learning Under Uncertainty
Rama Vedantam, Arthur Szlam, Max Nickel, Ari Morcos, Brenden Lake
Poster
Thu 21:00 Problem Dependent View on Structured Thresholding Bandit Problems
James Cheshire, Pierre MENARD, Alexandra Carpentier
Workshop
Fri 9:55 On-the-fly learning of adaptive strategies with bandit algorithms
Rashid Bakirov
Workshop
On-the-fly learning of adaptive strategies with bandit algorithms
Rashid Bakirov, Damien Fay, Bogdan Gabrys
Workshop
Statistical Inference with M-Estimators on Adaptively Collected Data
Kelly Zhang, Lucas Janson, Susan Murphy