Timezone: »
Poster
Hierarchical Diffusion for Offline Decision Making
Wenhao Li · Xiangfeng Wang · Bo Jin · Hongyuan Zha
Offline reinforcement learning typically introduces a hierarchical structure to solve the long-horizon problem so as to address its thorny issue of variance accumulation. Problems of deadly triad, limited data and reward sparsity, however, still remain, rendering the design of effective, hierarchical offline RL algorithms for general-purpose policy learning a formidable challenge. In this paper, we first formulate the problem of offline long-horizon decision-$\mathbf{M}$ak$\mathbf{I}$ng from the perspective of conditional generative modeling by incorporating goals into the control-as-inference graphic models. A $\mathbf{H}$ierarchical trajectory-level $\mathbf{D}$iffusion probabilistic model is then proposed with classifier-free guidance. HDMI employs a cascade framework that utilizes the reward-conditional goal diffuser for the subgoal discovery and the goal-conditional trajectory diffuser for generating the corresponding action sequence of subgoals. Planning-based subgoal extraction and transformer-based diffusion are employed to deal with the sub-optimal data pollution and long-range subgoal dependencies in the goal diffusion. Numerical experiments verify the advantages of HDMI on long-horizon decision-making compared to SOTA offline RL methods and conditional generative models.
Author Information
Wenhao Li (The Chinese University of Hong Kong, Shenzhen)
Xiangfeng Wang
Bo Jin (East China Normal University)
Hongyuan Zha (Shenzhen Institute of Artificial Intelligence and Robotics for Society; The Chinese University of Hong Kong, Shenzhen)
More from the Same Authors
-
2023 : Temporally-Extended Prompts Optimization for SAM in Interactive Medical Image Segmentation »
Chuyun Shen · Wenhao Li · Ya Zhang · Xiangfeng Wang -
2023 Poster: SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process »
Zichong Li · Yanbo Xu · Simiao Zuo · Haoming Jiang · Chao Zhang · Tuo Zhao · Hongyuan Zha -
2022 Poster: Hessian-Free High-Resolution Nesterov Acceleration For Sampling »
Ruilin Li · Hongyuan Zha · Molei Tao -
2022 Spotlight: Hessian-Free High-Resolution Nesterov Acceleration For Sampling »
Ruilin Li · Hongyuan Zha · Molei Tao -
2021 Poster: Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach »
Qitian Wu · Hengrui Zhang · Xiaofeng Gao · Junchi Yan · Hongyuan Zha -
2021 Spotlight: Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach »
Qitian Wu · Hengrui Zhang · Xiaofeng Gao · Junchi Yan · Hongyuan Zha