Skip to yearly menu bar Skip to main content


(10 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Tue Jul 25 08:30 PM -- 08:38 PM (PDT) @ Ballroom B None
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the Machiavelli Benchmark
Alexander Pan · Jun Shern Chan · Andy Zou · Nathaniel Li · Steven Basart · Thomas Woodside · Hanlin Zhang · Scott Emmons · Dan Hendrycks
[ PDF
Oral
Tue Jul 25 08:38 PM -- 08:46 PM (PDT) @ Ballroom B None
Information-Theoretic State Space Model for Multi-View Reinforcement Learning
HyeongJoo Hwang · Seokin Seo · Youngsoo Jang · Sungyoon Kim · Geon-Hyeong Kim · Seunghoon Hong · Kee-Eung Kim
[ PDF
Oral
Tue Jul 25 08:46 PM -- 08:54 PM (PDT) @ Ballroom B None
Reparameterized Policy Learning for Multimodal Trajectory Optimization
Zhiao Huang · Litian Liang · Zhan Ling · Xuanlin Li · Chuang Gan · Hao Su
[ PDF
Oral
Tue Jul 25 08:54 PM -- 09:02 PM (PDT) @ Ballroom B None
Representation Learning with Multi-Step Inverse Kinematics: An Efficient and Optimal Approach to Rich-Observation RL
Zakaria Mhammedi · Dylan Foster · Alexander Rakhlin
[ PDF
Oral
Tue Jul 25 09:02 PM -- 09:10 PM (PDT) @ Ballroom B None
Subequivariant Graph Reinforcement Learning in 3D Environments
Runfa Chen · Jiaqi Han · Fuchun Sun · Wenbing Huang
[ Slides [ PDF
Oral
Tue Jul 25 09:10 PM -- 09:18 PM (PDT) @ Ballroom B None
A Study of Global and Episodic Bonuses for Exploration in Contextual MDPs
Mikael Henaff · Minqi Jiang · Roberta Raileanu
[ PDF
Oral
Tue Jul 25 09:18 PM -- 09:26 PM (PDT) @ Ballroom B None
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap
Hang Wang · Sen Lin · Junshan Zhang
[ PDF
Oral
Tue Jul 25 09:26 PM -- 09:34 PM (PDT) @ Ballroom B None
Efficient RL via Disentangled Environment and Agent Representations
Kevin Gmelin · Shikhar Bahl · Russell Mendonca · Deepak Pathak
[ PDF
Oral
Tue Jul 25 09:34 PM -- 09:42 PM (PDT) @ Ballroom B None
Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning
Sam Lobel · Akhil Bagaria · George Konidaris
[ PDF
Oral
Tue Jul 25 09:42 PM -- 09:50 PM (PDT) @ Ballroom B None
On the Statistical Benefits of Temporal Difference Learning
David Cheikhi · Daniel Russo
[ PDF