Timezone: »
Probabilistic Circuits (PCs) are a general and unified computational framework for tractable probabilistic models that support efficient computation of various inference tasks (e.g., computing marginal probabilities). Towards enabling such reasoning capabilities in complex real-world tasks, Liu et al. (2022) propose to distill knowledge (through latent variable assignments) from less tractable but more expressive deep generative models. However, it is still unclear what factors make this distillation work well. In this paper, we theoretically and empirically discover that the performance of a PC can exceed that of its teacher model. Therefore, instead of performing distillation from the most expressive deep generative model, we study what properties the teacher model and the PC should have in order to achieve good distillation performance. This leads to a generic algorithmic improvement as well as other data-type-specific ones over the existing latent variable distillation pipeline. Empirically, we outperform SoTA TPMs by a large margin on challenging image modeling benchmarks. In particular, on ImageNet32, PCs achieve 4.06 bits-per-dimension, which is only 0.34 behind variational diffusion models (Kingma et al., 2021).
Author Information
Xuejie Liu (Peking University; Tsinghua University)
Anji Liu
Guy Van den Broeck (University of California, Los Angeles)
Yitao Liang (Peking University)
More from the Same Authors
-
2023 : SQA3D: Situated Question Answering in 3D Scenes »
Xiaojian Ma · Silong Yong · Zilong Zheng · Qing Li · Yitao Liang · Song-Chun Zhu · Siyuan Huang -
2023 : Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents »
Zihao Wang · Shaofei Cai · Guanzhou Chen · Anji Liu · Xiaojian Ma · Yitao Liang -
2023 : A Pseudo-Semantic Loss for Deep Generative Models with Logical Constraints »
Kareem Ahmed · Kai-Wei Chang · Guy Van den Broeck -
2023 : Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents »
Zihao Wang · Shaofei Cai · Guanzhou Chen · Anji Liu · Xiaojian Ma · Yitao Liang -
2023 : Collapsed Inference for Bayesian Deep Learning »
Zhe Zeng · Guy Van den Broeck -
2023 : SIMPLE: A Gradient Estimator for $k$-subset Sampling »
Kareem Ahmed · Zhe Zeng · Mathias Niepert · Guy Van den Broeck -
2023 : Probabilistic Task-Adaptive Graph Rewiring »
Chendi Qian · Andrei Manolache · Kareem Ahmed · Zhe Zeng · Guy Van den Broeck · Mathias Niepert · Christopher Morris -
2023 : A Unified Approach to Count-Based Weakly-Supervised Learning »
Vinay Shukla · Zhe Zeng · Kareem Ahmed · Guy Van den Broeck -
2023 : Panel on Reasoning Capabilities of LLMs »
Guy Van den Broeck · Ishita Dasgupta · Subbarao Kambhampati · Jiajun Wu · Xi Victoria Lin · Samy Bengio · Beliz Gunel -
2023 : AI can Learn from Data. But can it Learn to Reason? »
Guy Van den Broeck -
2023 Oral: Tractable Control for Autoregressive Language Generation »
Honghua Zhang · Meihua Dang · Nanyun Peng · Guy Van den Broeck -
2023 Poster: Tractable Control for Autoregressive Language Generation »
Honghua Zhang · Meihua Dang · Nanyun Peng · Guy Van den Broeck -
2022 : Session 3: New Computational Technologies for Reasoning »
Armando Solar-Lezama · Guy Van den Broeck · Jan-Willem van de Meent · Charles Sutton -
2022 : PA-GNN: Parameter-Adaptive Graph Neural Networks »
Yuxin Yang · Yitao Liang · Muhan Zhang -
2021 Poster: Probabilistic Generating Circuits »
Honghua Zhang · Brendan Juba · Guy Van den Broeck -
2021 Oral: Probabilistic Generating Circuits »
Honghua Zhang · Brendan Juba · Guy Van den Broeck -
2020 : On the Relationship Between Probabilistic Circuits and Determinantal Point Processes »
Honghua Zhang · Steven Holtzen · Guy Van den Broeck -
2020 Poster: Einsum Networks: Fast and Scalable Learning of Tractable Probabilistic Circuits »
Robert Peharz · Steven Lang · Antonio Vergari · Karl Stelzner · Alejandro Molina · Martin Trapp · Guy Van den Broeck · Kristian Kersting · Zoubin Ghahramani -
2020 Poster: Scaling up Hybrid Probabilistic Inference with Logical and Arithmetic Constraints via Message Passing »
Zhe Zeng · Paolo Morettin · Fanqi Yan · Antonio Vergari · Guy Van den Broeck -
2018 Poster: Sound Abstraction and Decomposition of Probabilistic Programs »
Steven Holtzen · Guy Van den Broeck · Todd Millstein -
2018 Oral: Sound Abstraction and Decomposition of Probabilistic Programs »
Steven Holtzen · Guy Van den Broeck · Todd Millstein -
2018 Poster: A Semantic Loss Function for Deep Learning with Symbolic Knowledge »
Jingyi Xu · Zilu Zhang · Tal Friedman · Yitao Liang · Guy Van den Broeck -
2018 Oral: A Semantic Loss Function for Deep Learning with Symbolic Knowledge »
Jingyi Xu · Zilu Zhang · Tal Friedman · Yitao Liang · Guy Van den Broeck