Poster
in
Workshop: Human-AI Collaboration in Sequential Decision-Making
Learning from an Exploring Demonstrator: Optimal Reward Estimation for Bandits
Wenshuo Guo
Abstract: