Interactive Learning with Implicit Human Feedback

Workshop

Interactive Learning with Implicit Human Feedback

Andi Peng · Akanksha Saran · Andreea Bobu · Tengyang Xie · Pierre-Yves Oudeyer · Anca Dragan · John Langford

Meeting Room 315

Sat 29 Jul, noon PDT

[ Abstract ] Workshop Website

Systems that can learn interactively from their end-users are quickly becoming widespread in real-world applications. Typically humans provide tagged rewards or scalar feedback for such interactive learning systems. However, humans offer a wealth of implicit information (such as multimodal cues in the form of natural language, speech, eye movements, facial expressions, gestures etc.) which interactive learning algorithms can leverage during the process of human-machine interaction to create a grounding for human intent, and thereby better assist end-users. A closed-loop sequential decision-making domain offers unique challenges when learning from humans -– (1) the data distribution may be influenced by the choices of the algorithm itself, and thus interactive ML algorithms need to adaptively learn from human feedback, (2) the nature of the environment itself changes rapidly, (3) humans may express their intent in various forms of feedback amenable to naturalistic real-world settings, going beyond tagged rewards or demonstrations. By organizing this workshop, we attempt to bring together interdisciplinary experts in interactive machine learning, reinforcement learning, human-computer interaction, cognitive science, and robotics to explore and foster discussions on such challenges. We envision that this exchange of ideas within and across disciplines can build new bridges, address some of the most valuable challenges in interactive learning with implicit human feedback, and also provide guidance to young researchers interested in growing their careers in this space.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 12:00 p.m. - 12:10 p.m.	Organizers: Introductory Remarks ( Remarks ) > SlidesLive Video	🔗
Sat 12:10 p.m. - 12:35 p.m.	Dorsa Sadigh: Interactive Learning in the Era of Large Models ( Invited talk ) > SlidesLive Video	🔗
Sat 12:35 p.m. - 1:00 p.m.	Jesse Thomason: Considering The Role of Language in Embodied Systems ( Invited talk ) > SlidesLive Video	🔗
Sat 1:00 p.m. - 1:30 p.m.	Coffee break + Poster Session	🔗
Sat 1:30 p.m. - 1:55 p.m.	Jonathan Grizou: Aiming for internal consistency, the 4th pillar of interactive learning ( Invited talk ) > SlidesLive Video	🔗
Sat 1:55 p.m. - 2:20 p.m.	Daniel Brown: Pitfalls and paths forward when learning rewards from human feedback ( Invited talk ) > SlidesLive Video	🔗
Sat 2:20 p.m. - 3:10 p.m.	Panel Session 1 ( Panel ) > SlidesLive Video	🔗
Sat 3:10 p.m. - 4:10 p.m.	Lunch break	🔗
Sat 4:10 p.m. - 4:35 p.m.	Bradley Knox: The EMPATHIC Framework for Task Learning from Implicit Human Feedback ( Invited talk ) > SlidesLive Video	🔗
Sat 4:35 p.m. - 5:00 p.m.	David Abel: Three Dogmas of Reinforcement Learning ( Invited talk ) > SlidesLive Video	🔗
Sat 5:00 p.m. - 5:25 p.m.	Paul Mineiro: Contextual Bandits without Rewards ( Invited talk ) > SlidesLive Video	🔗
Sat 5:25 p.m. - 6:00 p.m.	Contributed Talks ( Talks ) > SlidesLive Video	🔗
Sat 6:00 p.m. - 6:30 p.m.	Coffee break + Poster Session	🔗
Sat 6:30 p.m. - 7:00 p.m.	Taylor Kessler Faulkner: Robots Learning from Real People ( Invited talk ) > SlidesLive Video	🔗
Sat 7:00 p.m. - 7:50 p.m.	Panel Session 2 ( Panel ) > SlidesLive Video	🔗
Sat 7:50 p.m. - 8:00 p.m.	Organizers: Concluding Remarks ( Remarks ) > SlidesLive Video	🔗
-	Legible Robot Motion from Conditional Generative Models ( Poster ) > link Link	Matthew Bronars · Danfei Xu 🔗
-	Asymptotically Optimal Fixed-Budget Best Arm Identification with Variance-Dependent Bounds ( Poster ) > link Link	Masahiro Kato · Masaaki Imaizumi · Takuya Ishihara · Toru Kitagawa 🔗
-	RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback ( Poster ) > link Link	Yannick Metz · David Lindner · Raphaël Baur · Daniel Keim · Mennatallah El-Assady 🔗
-	Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation ( Poster ) > link Link	Thomas Kleine Büning · Aadirupa Saha · Christos Dimitrakakis · Haifeng Xu 🔗
-	A Generative Model for Text Control in Minecraft ( Poster ) > link Link	Shalev Lifshitz · Keiran Paster · Harris Chan · Jimmy Ba · Sheila McIlraith 🔗
-	Follow-ups Also Matter: Improving Contextual Bandits via Post-serving Contexts ( Poster ) > link Link	Chaoqi Wang · Ziyu Ye · Zhe Feng · Ashwinkumar Badanidiyuru · Haifeng Xu 🔗
-	Bayesian Inverse Transition Learning for Offline Settings ( Poster ) > link Link	Leo Benac · Sonali Parbhoo · Finale Doshi-Velez 🔗
-	Imitation Learning with Human Eye Gaze via Multi-Objective Prediction ( Spotlight + Poster ) > link Link	Ravi Thakur · MD Sunbeam · Vinicius G. Goecks · Ellen Novoseller · Ritwik Bera · Vernon Lawhern · Greg Gremillion · John Valasek · Nicholas Waytowich 🔗
-	Learning from a Learning User for Optimal Recommendations ( Spotlight + Poster ) > link Link	Fan Yao · Chuanhao Li · Denis Nekipelov · Hongning Wang · Haifeng Xu 🔗
-	Principal-Driven Reward Design and Agent Policy Alignment via Bilevel-RL ( Poster ) > link Link	Souradip Chakraborty · Amrit Bedi · Alec Koppel · Furong Huang · Mengdi Wang 🔗
-	Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation ( Poster ) > link Link	Chuyun Shen · Wenhao Li · Ya Zhang · Xiangfeng Wang 🔗
-	Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data ( Spotlight + Poster ) > link Link	Anqi Li · Dipendra Misra · Andrey Kolobov · Ching-An Cheng 🔗
-	Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences ( Poster ) > link Link	Lin Guan · Karthik Valmeekam · Subbarao Kambhampati 🔗
-	Complementing a Policy with a Different Observation Space ( Poster ) > link Link	Gokul Swamy · Sanjiban Choudhury · J. Bagnell · Steven Wu 🔗
-	Cognitive Models as Simulators: Using Cognitive Models to Tap into Implicit Human Feedback ( Poster ) > link Link	Ardavan S. Nobandegani · Thomas Shultz · Irina Rish 🔗
-	Unraveling the ARC Puzzle: Mimicking Human Solutions with Object-Centric Decision Transformer ( Poster ) > link Link	JAEHYUN PARK · Jaegyun Im · Sanha Hwang · Mintaek Lim · Sabina Ualibekova · Sejin Kim · Sundong Kim 🔗
-	Selective Sampling and Imitation Learning via Online Regression ( Poster ) > link Link	Ayush Sekhari · Karthik Sridharan · Wen Sun · Runzhe Wu 🔗
-	Learning Shared Safety Constraints from Multi-task Demonstrations ( Poster ) > link Link	Konwoo Kim · Gokul Swamy · Zuxin Liu · Ding Zhao · Sanjiban Choudhury · Steven Wu 🔗
-	Strategic Apple Tasting ( Poster ) > link Link	Keegan Harris · Chara Podimata · Steven Wu 🔗
-	Discovering User Types: Characterization of User Traits by Task-Specific Behaviors in Reinforcement Learning ( Poster ) > link Link	Lars L. Ankile · Brian Ham · Kevin Mao · Eura Shin · Siddharth Swaroop · Finale Doshi-Velez · Weiwei Pan 🔗
-	Provable Offline Reinforcement Learning with Human Feedback ( Poster ) > link Link	Wenhao Zhan · Masatoshi Uehara · Nathan Kallus · Jason Lee · Wen Sun 🔗
-	SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks ( Poster ) > link Link	Yuchen Lin · Yicheng Fu · Karina Yang · Prithviraj Ammanabrolu · Faeze Brahman · Shiyu Huang · Chandra Bhagavatula · Yejin Choi · Xiang Ren 🔗
-	How to Query Human Feedback Efficiently in RL? ( Poster ) > link Link	Wenhao Zhan · Masatoshi Uehara · Wen Sun · Jason Lee 🔗
-	Contextual Bandits and Imitation Learning with Preference-Based Active Queries ( Poster ) > link Link	Ayush Sekhari · Karthik Sridharan · Wen Sun · Runzhe Wu 🔗
-	Bayesian Active Meta-Learning under Prior Misspecification ( Poster ) > link Link	Sabina Sloman · Ayush Bharti · Samuel Kaski 🔗
-	UCB Provably Learns From Inconsistent Human Feedback ( Spotlight + Poster ) > link Link	Shuo Yang · Tongzheng Ren · Inderjit Dhillon · Sujay Sanghavi 🔗
-	Contextual Set Selection Under Human Feedback With Model Misspecification ( Poster ) > link Link	Shuo Yang · Rajat Sen · Sujay Sanghavi 🔗
-	Building Community Driven Libraries of Natural Programs ( Poster ) > link Link	Leonardo Hernandez Cano · Yewen Pu · Robert Hawkins · Josh Tenenbaum · Armando Solar-Lezama 🔗
-	Modeled Cognitive Feedback to Calibrate Uncertainty for Interactive Learning ( Poster ) > link Link	Jaelle Scheuerman · Zachary Bishof · Chris Michael 🔗
-	Improving Bionic Limb Control through Reinforcement Learning in an Interactive Game Environment ( Poster ) > link Link	Kilian Freitag · Rita Laezza · Jan Zbinden · Max Ortiz-Catalan 🔗
-	Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards ( Poster ) > link Link	Alexandre Rame · Guillaume Couairon · Corentin Dancette · Mustafa Shukor · Jean-Baptiste Gaya · Laure Soulier · Matthieu Cord 🔗
-	Inverse Preference Learning: Preference-based RL without a Reward Function ( Poster ) > link Link	Joey Hejna · Dorsa Sadigh 🔗
-	Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction ( Poster ) > link Link	Jonathan Pilault · Xavier Garcia · Arthur Brazinskas · Orhan Firat 🔗
-	Reinforcement learning with Human Feedback: Learning Dynamic Choices via Pessimism ( Poster ) > link Link	Zihao Li · Zhuoran Yang · Mengdi Wang 🔗
-	Accelerating exploration and representation learning with offline pre-training ( Poster ) > link Link	Bogdan Mazoure · Jake Bruce · Doina Precup · Rob Fergus · Ankit Anand 🔗
-	Active Learning with Crowd Sourcing Improves Information Retrieval ( Poster ) > link Link	Zhuotong Chen · Yifei Ma · Branislav Kveton · Anoop Deoras 🔗
-	Visual-based Policy Learning with Latent Language Encoding ( Spotlight + Poster ) > link Link	Jielin Qiu · Mengdi Xu · William Han · Bo Li · Ding Zhao 🔗
-	Guided Policy Search for Parameterized Skills using Adverbs ( Poster ) > link Link	Benjamin Spiegel · George Konidaris 🔗