Poster
in
Workshop: Challenges in Deployable Generative AI
Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models
Siyan Zhao · Aditya Grover
Keywords: [ Generative Models ] [ Reinforcement Learning ] [ offline RL ] [ modularity ] [ sequential decision making ]
Deployment of reinforcement learning algorithms in real-world scenarios often presents numerous challenges such as dealing with complex goals, planning future observations and actions, and critiquing their utilities, demanding a balance between expressivity and flexible modeling for efficient learning and inference.We present Decision Stacks, a generative framework that decomposes goal-conditioned policy agents into 3 generative modules which simulate the temporal evolution of observations, rewards, and actions. Our framework guarantees both expressivity and flexibility in designing individual modules to account for key factors such as architectural bias, optimization objective and dynamics, transferrability across domains, and inference speed. Our empirical results demonstrate the effectiveness of Decision Stacks for offline policy optimization for several MDP and POMDP environments.