Toggle Poster Visibility
Oral
Thu Jun 13 04:00 PM -- 04:20 PM (PDT) @ Hall B
Decentralized Exploration in Multi-Armed Bandits
[
Slides]
Oral
Thu Jun 13 04:20 PM -- 04:25 PM (PDT) @ Hall B
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
[
Slides]
Oral
Thu Jun 13 04:25 PM -- 04:30 PM (PDT) @ Hall B
Exploiting structure of uncertainty for efficient matroid semi-bandits
[
Slides]
Oral
Thu Jun 13 04:30 PM -- 04:35 PM (PDT) @ Hall B
PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits
[
Slides]
Oral
Thu Jun 13 04:35 PM -- 04:40 PM (PDT) @ Hall B
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
[
Slides]
Oral
Thu Jun 13 04:40 PM -- 05:00 PM (PDT) @ Hall B
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Oral
Thu Jun 13 05:00 PM -- 05:05 PM (PDT) @ Hall B
TarMAC: Targeted Multi-Agent Communication
[
Slides]
Oral
Thu Jun 13 05:05 PM -- 05:10 PM (PDT) @ Hall B
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
[
Slides]
Oral
Thu Jun 13 05:10 PM -- 05:15 PM (PDT) @ Hall B
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
[
Slides]
Oral
Thu Jun 13 05:15 PM -- 05:20 PM (PDT) @ Hall B
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation on Multi-Agent Reinforcement Learning
[
Slides]