Timezone: »
A key challenge for the successful deployment of many real world human-facing automated sequential decision-making systems is the need for human-AI collaboration. Effective collaboration ensures that the complementary abilities and skills of the human-users and the AI system are leveraged to maximize utility. This is for instance important in applications such as autonomous driving, in which a human user’s skill might be required in safety critical situations, or virtual personal assistants, in which a human user can perform real-world physical interactions which the AI system cannot. Facilitating such collaboration requires cooperation, coordination, and communication, e.g., in the form of accountability, teaching interactions, provision of feedback, etc. Without effective human-AI collaboration, the utility of automated sequential decision-making systems can be severely limited. Thus there is a surge of interest in better facilitating human-AI collaboration in academia and industry. Most existing research has focussed only on basic approaches for human-AI collaboration with little focus on long-term interactions and the breadth needed for next-generation applications. In this workshop we bring together researchers to advance this important topic, focussing on the following three directions: (a) Accountability and trust; (b) Adaptive behavior for long-term collaboration; (c) Robust collaboration under mismatch.
Fri 5:55 a.m. - 6:00 a.m.
|
Introduction
(
Live
)
SlidesLive Video » |
🔗 |
Fri 6:00 a.m. - 6:30 a.m.
|
Scaling up Probabilistic Safe Learning
(
Invited Talk
)
SlidesLive Video » |
Scott Niekum 🔗 |
Fri 6:30 a.m. - 7:00 a.m.
|
Detecting and Influencing the Social Dynamics of a Human-Robot Team
(
Invited Talk
)
SlidesLive Video » |
Sarah Sebo 🔗 |
Fri 7:00 a.m. - 7:10 a.m.
|
Break 1
|
🔗 |
Fri 7:10 a.m. - 8:00 a.m.
|
Poster spotlight presentations 1
(
Poster Spotlights
)
SlidesLive Video » |
Sebastian Tschiatschek · Adish Singla · Besmira Nushi 🔗 |
Fri 8:00 a.m. - 9:00 a.m.
|
Poster session 1
(
Poster Session
)
link »
https://eventhosts.gather.town/rEmR7OowhlDHrLlC/humanai-poster-room-1t A0: Kolby Nottingham, Anand Balakrishnan, Jyotirmoy Deshmukh and David Wingate, "Using Logical Specifications of Objectives in Multi-Objective Reinforcement Learning" A1: Ishaan Shah, David Halpern, Michael Littman and Kavosh Asadi, "Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback" A2: Vahid Balazadeh Meresht, Abir De, Adish Singla and Manuel Gomez Rodriguez, "Learning to Switch Among Agents in a Team" A3: H M Sajjad Hossain, Yash Chandak, Soundararajan Srinivasan, David Koleczek, Weihao Tan, Siddhant Pradhan, Vishal Rohra, Vivek Chettiar, Aaslesha Rajaram, Nicholas Perello and Nan Ma, "Intervention Aware Shared Autonomy" A4: Hengyuan Hu, Adam Lerer, Brandon Cui, David Wu, Luis Pineda, Noam Brown and Jakob Foerster, "Off Belief Learning" A5: Brandon Cui, Hengyuan Hu, Luis Pineda and Jakob Foerster, "K-level Reasoning for Zero-Shot Coordination in Hanabi" A6: Hamsa Bastani, Osbert Bastani and Wichinpong Sinchaisri, "Improving Human Decision-Making with Machine Learning" B0: Interactive Grounded Language Understanding in a Collaborative Environment |
🔗 |
Fri 9:00 a.m. - 9:30 a.m.
|
The Role of Conventions in Adaptive Human-AI Collaboration
(
Invited Talk
)
SlidesLive Video » |
Dorsa Sadigh 🔗 |
Fri 9:30 a.m. - 10:00 a.m.
|
Towards Human-like and Collaborative AI in Video Games
(
Invited Talk
)
SlidesLive Video » |
Katja Hofmann 🔗 |
Fri 10:00 a.m. - 10:10 a.m.
|
Break 2
|
🔗 |
Fri 10:10 a.m. - 11:00 a.m.
|
Poster spotlight presentations 2
(
Poster Spotlights
)
SlidesLive Video » |
Sebastian Tschiatschek · Adish Singla · Besmira Nushi 🔗 |
Fri 11:00 a.m. - 12:00 p.m.
|
Poster session 2
(
Poster session
)
link »
https://eventhosts.gather.town/RBepNDugnMC8rddK/humanai-poster-room-2t A0: Jeevana Inala, Yecheng Ma, Osbert Bastani, Xin Zhang and Armando Solar-Lezama, "Safe Human-Interactive Control via Shielding" A1: Jennifer Suriadinata, William Macke, Reuth Mirsky and Peter Stone, "Reasoning about Human Behavior in Ad Hoc Teamwork" A2: Gottipati Vijaya Sai Krishna, Cloderic Mars, Greg Szriftgiser, Sagar Kurandwad, Francois Chabot and Vincent Robert, "Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment And Operations" A3: Minori Narita, Sandhya Saisubramanian, Roderic A. Grupen and Shlomo Zilberstein, "Identifying Missing Features in State Representation for Safe Decision-Making" A4: Hengyuan Hu, Samuel Sokota, David Wu, Anton Bakhtin, Andrei Lupu, Noam Brown and Jakob Foerster, "Self-Explaining Deviations for Zero-Shot Coordination" A5: Aakriti Kumar, Trisha Patel, Aaron Benjamin and Mark Steyvers, "Explaining Algorithm Aversion with Metacognitive Bandits" A6: Daniel Shin and Daniel Brown, "Offline Preference-Based Apprenticeship Learning" B0: Wenshuo Guo, Kumar Agarwal, Aditya Grover, Vidya Muthukumar and Ashwin Pananjady, "Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits" |
🔗 |
Fri 12:00 p.m. - 12:30 p.m.
|
Robust Robot Learning via Human-Guided Intermediate Representations
(
Invited Talk
)
SlidesLive Video » |
Andreea Bobu 🔗 |
Fri 12:30 p.m. - 1:00 p.m.
|
Personalized Preference Learning - from Spinal Cord Stimulation to Exoskeletons
(
Invited Talk
)
SlidesLive Video » |
Yisong Yue 🔗 |
Fri 1:00 p.m. - 1:05 p.m.
|
Concluding remarks
(
Conluding remarks
)
SlidesLive Video » |
🔗 |
-
|
Using Logical Specifications of Objectives in Multi-Objective Reinforcement Learning ( Poster ) link » | Kolby Nottingham 🔗 |
-
|
Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback ( Poster ) link » | Ishaan Shah 🔗 |
-
|
Learning to Switch Among Agents in a Team ( Poster ) link » | Manuel Gomez-Rodriguez · Vahid Balazadeh Meresht 🔗 |
-
|
Intervention Aware Shared Autonomy ( Poster ) link » | Weihao Tan 🔗 |
-
|
Off Belief Learning ( Poster ) link » | Hengyuan Hu 🔗 |
-
|
K-level Reasoning for Zero-Shot Coordination in Hanabi ( Poster ) link » | Brandon Cui 🔗 |
-
|
Improving Human Decision-Making with Machine Learning ( Poster ) link » | Hamsa Bastani 🔗 |
-
|
Safe Human-Interactive Control via Shielding
(
Poster
)
|
Jeevana Priya Inala 🔗 |
-
|
Reasoning about Human Behavior in Ad Hoc Teamwork ( Poster ) link » | Reuth Mirsky 🔗 |
-
|
Cogment: Open Source Framework For Distributed Multi-actor Training, Deployment And Operations
(
Poster
)
|
Sai Krishna Gottipati 🔗 |
-
|
Identifying Missing Features in State Representation for Safe Decision-Making ( Poster ) link » | Minori Narita 🔗 |
-
|
Self-Explaining Deviations for Zero-Shot Coordination ( Poster ) link » | Hengyuan Hu 🔗 |
-
|
Explaining Algorithm Aversion with Metacognitive Bandits ( Poster ) link » | Aakriti Kumar 🔗 |
-
|
Offline Preference-Based Apprenticeship Learning ( Poster ) link » | Daniel Shin 🔗 |
-
|
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits ( Poster ) link » | Wenshuo Guo 🔗 |
-
|
Interactive Grounded Language Understanding in a Collaborative Environment
(
Poster
)
|
Julia Kiseleva 🔗 |
Author Information
Besmira Nushi (Microsoft Research)
Adish Singla (Max Planck Institute (MPI-SWS))

Adish Singla is a faculty member at the Max Planck Institute for Software Systems (MPI-SWS), Germany, where he has been leading the Machine Teaching Group since 2017. He conducts research in the area of Machine Teaching, with a particular focus on open-ended learning and problem-solving domains. In recent years, his research has centered around developing AI-driven educational technology for introductory programming environments. He has received several awards for his research, including an AAAI Outstanding Paper Honorable Mention Award (2022) and an ERC Starting Grant (2021). He also has extensive experience working in the industry and is a recipient of several industry awards, including a research grant from Microsoft Research Ph.D. Scholarship Programme (2018), Facebook Graduate Fellowship (2015), Microsoft Tech Transfer Award (2011), and Microsoft Gold Star Award (2010).
Sebastian Tschiatschek (University of Vienna)
More from the Same Authors
-
2021 : Test Poster (to delete) »
Sebastian Tschiatschek -
2021 : Poster spotlight presentations 2 »
Sebastian Tschiatschek · Adish Singla · Besmira Nushi -
2021 : Poster spotlight presentations 1 »
Sebastian Tschiatschek · Adish Singla · Besmira Nushi -
2020 Poster: Adaptive Reward-Poisoning Attacks against Reinforcement Learning »
Xuezhou Zhang · Yuzhe Ma · Adish Singla · Jerry Zhu -
2020 Poster: Policy Teaching via Environment Poisoning: Training-time Adversarial Attacks against Reinforcement Learning »
Amin Rakhsha · Goran Radanovic · Rati Devidze · Jerry Zhu · Adish Singla -
2019 Poster: Efficient learning of smooth probability functions from Bernoulli tests with guarantees »
Paul Rolland · Ali Kavis · Alexander Niklaus Immer · Adish Singla · Volkan Cevher -
2019 Oral: Efficient learning of smooth probability functions from Bernoulli tests with guarantees »
Paul Rolland · Ali Kavis · Alexander Niklaus Immer · Adish Singla · Volkan Cevher -
2019 Poster: Learning to Collaborate in Markov Decision Processes »
Goran Radanovic · Rati Devidze · David Parkes · Adish Singla -
2019 Oral: Learning to Collaborate in Markov Decision Processes »
Goran Radanovic · Rati Devidze · David Parkes · Adish Singla