Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Jun 14 08:00 AM -- 08:20 AM (KST) @ Hall B
Decentralized Exploration in Multi-Armed Bandits
Raphaël Féraud · REDA ALAMI · Romain Laroche
[ Slides
Oral
Fri Jun 14 08:20 AM -- 08:25 AM (KST) @ Hall B
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Chicheng Zhang · Alekh Agarwal · Hal Daumé III · John Langford · Sahand Negahban
[ Slides
Oral
Fri Jun 14 08:25 AM -- 08:30 AM (KST) @ Hall B
Exploiting structure of uncertainty for efficient matroid semi-bandits
Pierre Perrault · Vianney Perchet · Michal Valko
[ Slides
Oral
Fri Jun 14 08:30 AM -- 08:35 AM (KST) @ Hall B
PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits
Arghya Roy Chaudhuri · Shivaram Kalyanakrishnan
[ Slides
Oral
Fri Jun 14 08:35 AM -- 08:40 AM (KST) @ Hall B
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Gi-Soo Kim · Myunghee Cho Paik
[ Slides
Oral
Fri Jun 14 08:40 AM -- 09:00 AM (KST) @ Hall B
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Jakob Foerster · Francis Song · Edward Hughes · Neil Burch · Iain Dunning · Shimon Whiteson · Matthew Botvinick · Michael Bowling
Oral
Fri Jun 14 09:00 AM -- 09:05 AM (KST) @ Hall B
TarMAC: Targeted Multi-Agent Communication
Abhishek Das · Theophile Gervet · Joshua Romoff · Dhruv Batra · Devi Parikh · Michael Rabbat · Joelle Pineau
[ Slides
Oral
Fri Jun 14 09:05 AM -- 09:10 AM (KST) @ Hall B
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son · Daewoo Kim · Wan Ju Kang · David Earl Hostallero · Yung Yi
[ Slides
Oral
Fri Jun 14 09:10 AM -- 09:15 AM (KST) @ Hall B
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal · Fei Sha
[ Slides
Oral
Fri Jun 14 09:15 AM -- 09:20 AM (KST) @ Hall B
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation on Multi-Agent Reinforcement Learning
Thinh Doan · Siva Maguluri · Justin Romberg
[ Slides