Timezone: »
We introduce an approach for understanding control policies represented as recurrent neural networks. Recent work has approached this problem by transforming such recurrent policy networks into finite-state machines (FSM) and then analyzing the equivalent minimized FSM. While this led to interesting insights, the minimization process can obscure a deeper understanding of a machine's operation by merging states that are semantically distinct. To address this issue, we introduce an analysis approach that starts with an unminimized FSM and applies more-interpretable reductions that preserve the key decision points of the policy. We also contribute an attention tool to attain a deeper understanding of the role of observations in the decisions. Our case studies on 7 Atari games and 3 control benchmarks demonstrate that the approach can reveal insights that have not been previously noticed.
Author Information
Mohamad H Danesh (Oregon State University)
Anurag Koul (Oregon State University)
Deep Reinforcement Learning + Explainable Artificial Intelligence
Alan Fern (Oregon State University)
Saeed Khorram (Oregon State University)
Related Events (a corresponding poster, oral, or spotlight)
-
2021 Spotlight: Re-understanding Finite-State Representations of Recurrent Policy Networks »
Wed. Jul 21st 12:20 -- 12:25 AM Room
More from the Same Authors
-
2021 : Out-of-Distribution Dynamics Detection: RL-Relevant Benchmarks and Results »
Mohamad H Danesh -
2021 : RL Explainability & Interpretability Panel »
Ofra Amir · Finale Doshi-Velez · Alan Fern · Zachary Lipton · Omer Gottesman · Niranjani Prasad -
2018 Poster: Visualizing and Understanding Atari Agents »
Samuel Greydanus · Anurag Koul · Jonathan Dodge · Alan Fern -
2018 Poster: Open Category Detection with PAC Guarantees »
Si Liu · Risheek Garrepalli · Thomas Dietterich · Alan Fern · Dan Hendrycks -
2018 Oral: Open Category Detection with PAC Guarantees »
Si Liu · Risheek Garrepalli · Thomas Dietterich · Alan Fern · Dan Hendrycks -
2018 Oral: Visualizing and Understanding Atari Agents »
Samuel Greydanus · Anurag Koul · Jonathan Dodge · Alan Fern