firstbacksecondback
Filter by Keyword:
131 Results
Poster
|
Thu 21:00 |
Optimal Thompson Sampling strategies for support-aware CVaR bandits Dorian Baudry · Romain Gautron · Emilie Kaufmann · Odalric-Ambrym Maillard |
|
Spotlight
|
Tue 17:25 |
Adapting to Delays and Data in Adversarial Multi-Armed Bandits András György · Pooria Joulani |
|
Oral Session
|
Wed 17:00 |
Bandits 2 |
|
Poster
|
Thu 9:00 |
Directional Bias Amplification Angelina Wang · Olga Russakovsky |
|
Spotlight
|
Wed 7:35 |
Deciding What to Learn: A Rate-Distortion Approach Dilip Arumugam · Benjamin Van Roy |
|
Spotlight
|
Wed 18:25 |
Top-k eXtreme Contextual Bandits with Arm Hierarchy Rajat Sen · Alexander Rakhlin · Lexing Ying · Rahul Kidambi · Dean Foster · Daniel Hill · Inderjit Dhillon |
|
Poster
|
Wed 21:00 |
Adapting to misspecification in contextual bandits with offline regression oracles Sanath Kumar Krishnamurthy · Vitor Hadad · Susan Athey |
|
Spotlight
|
Thu 6:40 |
Decoupling Representation Learning from Reinforcement Learning Adam Stooke · Kimin Lee · Pieter Abbeel · Michael Laskin |
|
Poster
|
Thu 9:00 |
Decoupling Representation Learning from Reinforcement Learning Adam Stooke · Kimin Lee · Pieter Abbeel · Michael Laskin |
|
Spotlight
|
Wed 17:40 |
Approximation Theory Based Methods for RKHS Bandits Sho Takemori · Masahiro Sato |
|
Spotlight
|
Wed 17:20 |
Dynamic Planning and Learning under Recovering Rewards David Simchi-Levi · Zeyu Zheng · Feng Zhu |
|
Oral Session
|
Wed 19:00 |
Bandits 4 |