Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Jun 13 04:00 PM -- 04:20 PM (PDT) @ Hall B
Decentralized Exploration in Multi-Armed Bandits
Raphaël Féraud · REDA ALAMI · Romain Laroche
[ Slides
Oral
Thu Jun 13 04:20 PM -- 04:25 PM (PDT) @ Hall B
Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback
Chicheng Zhang · Alekh Agarwal · Hal Daumé III · John Langford · Sahand Negahban
[ Slides
Oral
Thu Jun 13 04:25 PM -- 04:30 PM (PDT) @ Hall B
Exploiting structure of uncertainty for efficient matroid semi-bandits
Pierre Perrault · Vianney Perchet · Michal Valko
[ Slides
Oral
Thu Jun 13 04:30 PM -- 04:35 PM (PDT) @ Hall B
PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits
Arghya Roy Chaudhuri · Shivaram Kalyanakrishnan
[ Slides
Oral
Thu Jun 13 04:35 PM -- 04:40 PM (PDT) @ Hall B
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Gi-Soo Kim · Myunghee Cho Paik
[ Slides
Oral
Thu Jun 13 04:40 PM -- 05:00 PM (PDT) @ Hall B
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Jakob Foerster · Francis Song · Edward Hughes · Neil Burch · Iain Dunning · Shimon Whiteson · Matthew Botvinick · Michael Bowling
Oral
Thu Jun 13 05:00 PM -- 05:05 PM (PDT) @ Hall B
TarMAC: Targeted Multi-Agent Communication
Abhishek Das · Theophile Gervet · Joshua Romoff · Dhruv Batra · Devi Parikh · Michael Rabbat · Joelle Pineau
[ Slides
Oral
Thu Jun 13 05:05 PM -- 05:10 PM (PDT) @ Hall B
QTRAN: Learning to Factorize with Transformation for Cooperative Multi-Agent Reinforcement Learning
Kyunghwan Son · Daewoo Kim · Wan Ju Kang · David Earl Hostallero · Yung Yi
[ Slides
Oral
Thu Jun 13 05:10 PM -- 05:15 PM (PDT) @ Hall B
Actor-Attention-Critic for Multi-Agent Reinforcement Learning
Shariq Iqbal · Fei Sha
[ Slides
Oral
Thu Jun 13 05:15 PM -- 05:20 PM (PDT) @ Hall B
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation on Multi-Agent Reinforcement Learning
Thinh Doan · Siva Maguluri · Justin Romberg
[ Slides