Sat 12:00 p.m. - 12:10 p.m.
|
Organizers: Introductory Remarks
(
Remarks
)
>
SlidesLive Video
|
🔗
|
Sat 12:10 p.m. - 12:35 p.m.
|
Dorsa Sadigh: Interactive Learning in the Era of Large Models
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 12:35 p.m. - 1:00 p.m.
|
Jesse Thomason: Considering The Role of Language in Embodied Systems
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 1:00 p.m. - 1:30 p.m.
|
Coffee break + Poster Session
|
🔗
|
Sat 1:30 p.m. - 1:55 p.m.
|
Jonathan Grizou: Aiming for internal consistency, the 4th pillar of interactive learning
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 1:55 p.m. - 2:20 p.m.
|
Daniel Brown: Pitfalls and paths forward when learning rewards from human feedback
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 2:20 p.m. - 3:10 p.m.
|
Panel Session 1
(
Panel
)
>
SlidesLive Video
|
🔗
|
Sat 3:10 p.m. - 4:10 p.m.
|
Lunch break
|
🔗
|
Sat 4:10 p.m. - 4:35 p.m.
|
Bradley Knox: The EMPATHIC Framework for Task Learning from Implicit Human Feedback
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 4:35 p.m. - 5:00 p.m.
|
David Abel: Three Dogmas of Reinforcement Learning
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 5:00 p.m. - 5:25 p.m.
|
Paul Mineiro: Contextual Bandits without Rewards
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 5:25 p.m. - 6:00 p.m.
|
Contributed Talks
(
Talks
)
>
SlidesLive Video
|
🔗
|
Sat 6:00 p.m. - 6:30 p.m.
|
Coffee break + Poster Session
|
🔗
|
Sat 6:30 p.m. - 7:00 p.m.
|
Taylor Kessler Faulkner: Robots Learning from Real People
(
Invited talk
)
>
SlidesLive Video
|
🔗
|
Sat 7:00 p.m. - 7:50 p.m.
|
Panel Session 2
(
Panel
)
>
SlidesLive Video
|
🔗
|
Sat 7:50 p.m. - 8:00 p.m.
|
Organizers: Concluding Remarks
(
Remarks
)
>
SlidesLive Video
|
🔗
|
-
|
Legible Robot Motion from Conditional Generative Models
(
Poster
)
>
link
|
Matthew Bronars · Danfei Xu
🔗
|
-
|
Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds
(
Poster
)
>
link
|
Masahiro Kato · Masaaki Imaizumi · Takuya Ishihara · Toru Kitagawa
🔗
|
-
|
RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback
(
Poster
)
>
link
|
Yannick Metz · David Lindner · Raphaël Baur · Daniel Keim · Mennatallah El-Assady
🔗
|
-
|
Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation
(
Poster
)
>
link
|
Thomas Kleine Büning · Aadirupa Saha · Christos Dimitrakakis · Haifeng Xu
🔗
|
-
|
A Generative Model for Text Control in Minecraft
(
Poster
)
>
link
|
Shalev Lifshitz · Keiran Paster · Harris Chan · Jimmy Ba · Sheila McIlraith
🔗
|
-
|
Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts
(
Poster
)
>
link
|
Chaoqi Wang · Ziyu Ye · Zhe Feng · Ashwinkumar Badanidiyuru · Haifeng Xu
🔗
|
-
|
Bayesian Inverse Transition Learning for Offline Settings
(
Poster
)
>
link
|
Leo Benac · Sonali Parbhoo · Finale Doshi-Velez
🔗
|
-
|
Imitation Learning with Human Eye Gaze via Multi-Objective Prediction
(
Spotlight + Poster
)
>
link
|
Ravi Thakur · MD Sunbeam · Vinicius G. Goecks · Ellen Novoseller · Ritwik Bera · Vernon Lawhern · Greg Gremillion · John Valasek · Nicholas Waytowich
🔗
|
-
|
Learning from a Learning User for Optimal Recommendations
(
Spotlight + Poster
)
>
link
|
Fan Yao · Chuanhao Li · Denis Nekipelov · Hongning Wang · Haifeng Xu
🔗
|
-
|
Principal-Driven Reward Design and Agent Policy Alignment via Bilevel-RL
(
Poster
)
>
link
|
Souradip Chakraborty · Amrit Bedi · Alec Koppel · Furong Huang · Mengdi Wang
🔗
|
-
|
Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation
(
Poster
)
>
link
|
Chuyun Shen · Wenhao Li · Ya Zhang · Xiangfeng Wang
🔗
|
-
|
Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data
(
Spotlight + Poster
)
>
link
|
Anqi Li · Dipendra Misra · Andrey Kolobov · Ching-An Cheng
🔗
|
-
|
Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences
(
Poster
)
>
link
|
Lin Guan · Karthik Valmeekam · Subbarao Kambhampati
🔗
|
-
|
Complementing a Policy with a Different Observation Space
(
Poster
)
>
link
|
Gokul Swamy · Sanjiban Choudhury · J. Bagnell · Steven Wu
🔗
|
-
|
Cognitive Models as Simulators: Using Cognitive Models to Tap into Implicit Human Feedback
(
Poster
)
>
link
|
Ardavan S. Nobandegani · Thomas Shultz · Irina Rish
🔗
|
-
|
Unraveling the ARC Puzzle: Mimicking Human Solutions with Object-Centric Decision Transformer
(
Poster
)
>
link
|
JAEHYUN PARK · Jaegyun Im · Sanha Hwang · Mintaek Lim · Sabina Ualibekova · Sejin Kim · Sundong Kim
🔗
|
-
|
Selective Sampling and Imitation Learning via Online Regression
(
Poster
)
>
link
|
Ayush Sekhari · Karthik Sridharan · Wen Sun · Runzhe Wu
🔗
|
-
|
Learning Shared Safety Constraints from Multi-task Demonstrations
(
Poster
)
>
link
|
Konwoo Kim · Gokul Swamy · Zuxin Liu · Ding Zhao · Sanjiban Choudhury · Steven Wu
🔗
|
-
|
Strategic Apple Tasting
(
Poster
)
>
link
|
Keegan Harris · Chara Podimata · Steven Wu
🔗
|
-
|
Discovering User Types: Characterization of User Traits by Task-Specific Behaviors in Reinforcement Learning
(
Poster
)
>
link
|
Lars L. Ankile · Brian Ham · Kevin Mao · Eura Shin · Siddharth Swaroop · Finale Doshi-Velez · Weiwei Pan
🔗
|
-
|
Provable Offline Reinforcement Learning with Human Feedback
(
Poster
)
>
link
|
Wenhao Zhan · Masatoshi Uehara · Nathan Kallus · Jason Lee · Wen Sun
🔗
|
-
|
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
(
Poster
)
>
link
|
Yuchen Lin · Yicheng Fu · Karina Yang · Prithviraj Ammanabrolu · Faeze Brahman · Shiyu Huang · Chandra Bhagavatula · Yejin Choi · Xiang Ren
🔗
|
-
|
How to Query Human Feedback Efficiently in RL?
(
Poster
)
>
link
|
Wenhao Zhan · Masatoshi Uehara · Wen Sun · Jason Lee
🔗
|
-
|
Contextual Bandits and Imitation Learning with Preference-Based Active Queries
(
Poster
)
>
link
|
Ayush Sekhari · Karthik Sridharan · Wen Sun · Runzhe Wu
🔗
|
-
|
Bayesian Active Meta-Learning under Prior Misspecification
(
Poster
)
>
link
|
Sabina Sloman · Ayush Bharti · Samuel Kaski
🔗
|
-
|
UCB Provably Learns From Inconsistent Human Feedback
(
Spotlight + Poster
)
>
link
|
Shuo Yang · Tongzheng Ren · Inderjit Dhillon · Sujay Sanghavi
🔗
|
-
|
Contextual Set Selection Under Human Feedback With Model Misspecification
(
Poster
)
>
link
|
Shuo Yang · Rajat Sen · Sujay Sanghavi
🔗
|
-
|
Building Community Driven Libraries of Natural Programs
(
Poster
)
>
link
|
Leonardo Hernandez Cano · Yewen Pu · Robert Hawkins · Josh Tenenbaum · Armando Solar-Lezama
🔗
|
-
|
Modeled Cognitive Feedback to Calibrate Uncertainty for Interactive Learning
(
Poster
)
>
link
|
Jaelle Scheuerman · Zachary Bishof · Chris Michael
🔗
|
-
|
Improving Bionic Limb Control through Reinforcement Learning in an Interactive Game Environment
(
Poster
)
>
link
|
Kilian Freitag · Rita Laezza · Jan Zbinden · Max Ortiz-Catalan
🔗
|
-
|
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
(
Poster
)
>
link
|
Alexandre Rame · Guillaume Couairon · Corentin Dancette · Mustafa Shukor · Jean-Baptiste Gaya · Laure Soulier · Matthieu Cord
🔗
|
-
|
Inverse Preference Learning: Preference-based RL without a Reward Function
(
Poster
)
>
link
|
Joey Hejna · Dorsa Sadigh
🔗
|
-
|
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
(
Poster
)
>
link
|
Jonathan Pilault · Xavier Garcia · Arthur Brazinskas · Orhan Firat
🔗
|
-
|
Reinforcement learning with Human Feedback: Learning Dynamic Choices via Pessimism
(
Poster
)
>
link
|
Zihao Li · Zhuoran Yang · Mengdi Wang
🔗
|
-
|
Accelerating exploration and representation learning with offline pre-training
(
Poster
)
>
link
|
Bogdan Mazoure · Jake Bruce · Doina Precup · Rob Fergus · Ankit Anand
🔗
|
-
|
Active Learning with Crowd Sourcing Improves Information Retrieval
(
Poster
)
>
link
|
Zhuotong Chen · Yifei Ma · Branislav Kveton · Anoop Deoras
🔗
|
-
|
Visual-based Policy Learning with Latent Language Encoding
(
Spotlight + Poster
)
>
link
|
Jielin Qiu · Mengdi Xu · William Han · Bo Li · Ding Zhao
🔗
|
-
|
Guided Policy Search for Parameterized Skills using Adverbs
(
Poster
)
>
link
|
Benjamin Spiegel · George Konidaris
🔗
|