Toggle Poster Visibility
Oral
Tue Jul 25 08:30 PM -- 08:38 PM (PDT) @ Ballroom B None
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark
[
PDF]
Oral
Tue Jul 25 08:38 PM -- 08:46 PM (PDT) @ Ballroom B None
Information-Theoretic State Space Model for Multi-View Reinforcement Learning
[
PDF]
Oral
Tue Jul 25 08:46 PM -- 08:54 PM (PDT) @ Ballroom B None
Reparameterized Policy Learning for Multimodal Trajectory Optimization
[
PDF]
Oral
Tue Jul 25 08:54 PM -- 09:02 PM (PDT) @ Ballroom B None
Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL
[
PDF]
Oral
Tue Jul 25 09:10 PM -- 09:18 PM (PDT) @ Ballroom B None
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
[
PDF]
Oral
Tue Jul 25 09:18 PM -- 09:26 PM (PDT) @ Ballroom B None
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
[
PDF]
Oral
Tue Jul 25 09:26 PM -- 09:34 PM (PDT) @ Ballroom B None
Efficient RL via Disentangled Environment and Agent Representations
[
PDF]
Oral
Tue Jul 25 09:34 PM -- 09:42 PM (PDT) @ Ballroom B None
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
[
PDF]
Oral
Tue Jul 25 09:42 PM -- 09:50 PM (PDT) @ Ballroom B None
On the Statistical Benefits of Temporal Difference Learning
[
PDF]