Toggle Poster Visibility
Oral
Fri Jun 14 08:00 AM -- 08:20 AM (KST) @ Hall B
Decentralized Exploration in Multi-Armed Bandits
[
Slides]
Oral
Fri Jun 14 08:20 AM -- 08:25 AM (KST) @ Hall B
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
[
Slides]
Oral
Fri Jun 14 08:25 AM -- 08:30 AM (KST) @ Hall B
Exploiting structure of uncertainty for efficient matroid semi-bandits
[
Slides]
Oral
Fri Jun 14 08:30 AM -- 08:35 AM (KST) @ Hall B
PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits
[
Slides]
Oral
Fri Jun 14 08:35 AM -- 08:40 AM (KST) @ Hall B
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
[
Slides]
Oral
Fri Jun 14 08:40 AM -- 09:00 AM (KST) @ Hall B
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Oral
Fri Jun 14 09:00 AM -- 09:05 AM (KST) @ Hall B
TarMAC: Targeted Multi-Agent Communication
[
Slides]
Oral
Fri Jun 14 09:05 AM -- 09:10 AM (KST) @ Hall B
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
[
Slides]
Oral
Fri Jun 14 09:10 AM -- 09:15 AM (KST) @ Hall B
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
[
Slides]
Oral
Fri Jun 14 09:15 AM -- 09:20 AM (KST) @ Hall B
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation on Multi-Agent Reinforcement Learning
[
Slides]
Successful Page Load