firstbacksecondback
58 Results
Spotlight
|
Wed 14:45 |
Reachability Constrained Reinforcement Learning Dongjie Yu · Haitong Ma · Shengbo Li · Jianyu Chen |
|
Poster
|
Wed 15:30 |
REvolveR: Continuous Evolutionary Models for Robot-to-robot Policy Transfer Xingyu Liu · Deepak Pathak · Kris Kitani |
|
Poster
|
Thu 15:00 |
Cliff Diving: Exploring Reward Surfaces in Reinforcement Learning Environments Ryan Sullivan · Jordan Terry · Benjamin Black · John P Dickerson |
|
Spotlight
|
Wed 10:20 |
Analysis of Stochastic Processes through Replay Buffers Shirli Di-Castro Shashua · Shie Mannor · Dotan Di Castro |
|
Poster
|
Wed 15:30 |
Reachability Constrained Reinforcement Learning Dongjie Yu · Haitong Ma · Shengbo Li · Jianyu Chen |
|
Spotlight
|
Thu 11:50 |
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks Litian Liang · Yaosheng Xu · Stephen Mcaleer · Dailin Hu · Alexander Ihler · Pieter Abbeel · Roy Fox |
|
Poster
|
Thu 15:00 |
Do Differentiable Simulators Give Better Policy Gradients? Hyung Ju Suh · Max Simchowitz · Kaiqing Zhang · Russ Tedrake |
|
Oral
|
Thu 11:05 |
Do Differentiable Simulators Give Better Policy Gradients? Hyung Ju Suh · Max Simchowitz · Kaiqing Zhang · Russ Tedrake |
|
Poster
|
Wed 15:30 |
Analysis of Stochastic Processes through Replay Buffers Shirli Di-Castro Shashua · Shie Mannor · Dotan Di Castro |
|
Poster
|
Thu 15:00 |
Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks Litian Liang · Yaosheng Xu · Stephen Mcaleer · Dailin Hu · Alexander Ihler · Pieter Abbeel · Roy Fox |
|
Spotlight
|
Wed 8:25 |
LeNSE: Learning To Navigate Subgraph Embeddings for Large-Scale Combinatorial Optimisation David Ireland · Giovanni Montana |
|
Spotlight
|
Tue 13:35 |
IDYNO: Learning Nonparametric DAGs from Interventional Dynamic Data Tian Gao · DEBARUN BHATTACHARJYA · Elliot Nelson · Miao Liu · Yue Yu |