Timezone: »
Guided Policy Search for Parameterized Skills using Adverbs
Benjamin Spiegel · George Konidaris
Event URL: https://openreview.net/forum?id=4cSHQzzFXt »
We present a method for using adverb phrases to adjust skill parameters via learned \textit{adverb-skill groundings}. These groundings allow an agent to use adverb feedback provided by a human to directly update a skill policy in a manner similar to traditional local policy search methods. We show that our method can be used as a drop-in replacement for these policy search methods when dense reward from the environment is not available but human language feedback is. We demonstrate improved sample efficiency over modern policy search methods in two experiments.
Author Information
Benjamin Spiegel (Brown University)
George Konidaris (Brown)
More from the Same Authors
-
2023 Poster: Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning »
Sam Lobel · Akhil Bagaria · George Konidaris -
2023 Oral: Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning »
Sam Lobel · Akhil Bagaria · George Konidaris -
2023 Poster: Meta-learning Parameterized Skills »
Haotian Fu · Shangqun Yu · Saket Tiwari · Michael L. Littman · George Konidaris -
2023 Poster: RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents »
Rafael A Rodriguez-Sanchez · Benjamin Spiegel · Jennifer Wang · Roma Patel · Stefanie Tellex · George Konidaris -
2021 : RL + Robotics Panel »
George Konidaris · Jan Peters · Martin Riedmiller · Angela Schoellig · Rose Yu · Rupam Mahmood -
2021 Poster: Skill Discovery for Exploration and Planning using Deep Skill Graphs »
Akhil Bagaria · Jason Senthil · George Konidaris -
2021 Oral: Skill Discovery for Exploration and Planning using Deep Skill Graphs »
Akhil Bagaria · Jason Senthil · George Konidaris -
2020 Poster: Learning Portable Representations for High-Level Planning »
Steven James · Benjamin Rosman · George Konidaris -
2019 Poster: Finding Options that Minimize Planning Time »
Yuu Jinnai · David Abel · David Hershkowitz · Michael L. Littman · George Konidaris -
2019 Oral: Finding Options that Minimize Planning Time »
Yuu Jinnai · David Abel · David Hershkowitz · Michael L. Littman · George Konidaris -
2019 Poster: Discovering Options for Exploration by Minimizing Cover Time »
Yuu Jinnai · Jee Won Park · David Abel · George Konidaris -
2019 Oral: Discovering Options for Exploration by Minimizing Cover Time »
Yuu Jinnai · Jee Won Park · David Abel · George Konidaris -
2018 Poster: Policy and Value Transfer in Lifelong Reinforcement Learning »
David Abel · Yuu Jinnai · Sophie Guo · George Konidaris · Michael L. Littman -
2018 Oral: Policy and Value Transfer in Lifelong Reinforcement Learning »
David Abel · Yuu Jinnai · Sophie Guo · George Konidaris · Michael L. Littman