[ protected link dropped ]
A0: Kolby Nottingham, Anand Balakrishnan, Jyotirmoy Deshmukh and David Wingate, "Using Logical Specifications of Objectives in Multi-Objective Reinforcement Learning"
A1: Ishaan Shah, David Halpern, Michael Littman and Kavosh Asadi, "Convergence of a Human-in-the-Loop Policy-Gradient Algorithm With Eligibility Trace Under Reward, Policy, and Advantage Feedback"
A2: Vahid Balazadeh Meresht, Abir De, Adish Singla and Manuel Gomez Rodriguez, "Learning to Switch Among Agents in a Team"
A3: H M Sajjad Hossain, Yash Chandak, Soundararajan Srinivasan, David Koleczek, Weihao Tan, Siddhant Pradhan, Vishal Rohra, Vivek Chettiar, Aaslesha Rajaram, Nicholas Perello and Nan Ma, "Intervention Aware Shared Autonomy"
A4: Hengyuan Hu, Adam Lerer, Brandon Cui, David Wu, Luis Pineda, Noam Brown and Jakob Foerster, "Off Belief Learning"
A5: Brandon Cui, Hengyuan Hu, Luis Pineda and Jakob Foerster, "K-level Reasoning for Zero-Shot Coordination in Hanabi"
A6: Hamsa Bastani, Osbert Bastani and Wichinpong Sinchaisri, "Improving Human Decision-Making with Machine Learning"
B0: Interactive Grounded Language Understanding in a Collaborative Environment