Timezone: »
Drug discovery aims to find novel compounds with specified chemical property profiles. In terms of generative modeling, the goal is to learn to sample molecules in the intersection of multiple property constraints. This task becomes increasingly challenging when there are many property constraints. We propose to offset this complexity by composing molecules from a vocabulary of substructures that we call molecular rationales. These rationales are identified from molecules as substructures that are likely responsible for each property of interest. We then learn to expand rationales into a full molecule using graph generative models. Our final generative model composes molecules as mixtures of multiple rationale completions, and this mixture is fine-tuned to preserve the properties of interest. We evaluate our model on various drug design tasks and demonstrate significant improvements over state-of-the-art baselines in terms of accuracy, diversity, and novelty of generated compounds.
Author Information
Wengong Jin (MIT)
Regina Barzilay (MIT CSAIL)

Regina Barzilay is an Israeli-American computer scientist. She is a professor at the Massachusetts Institute of Technology and a faculty lead for artificial intelligence at the MIT Jameel Clinic. Her research interests are in natural language processing and applications of deep learning to chemistry and oncology.
Tommi Jaakkola (MIT)
More from the Same Authors
-
2023 : Optimizing protein fitness using Bi-level Gibbs sampling with Graph-based Smoothing »
Andrew Kirjner · Jason Yim · Raman Samusevich · Tommi Jaakkola · Regina Barzilay · Ila R. Fiete -
2023 : Optimizing protein fitness using Gibbs sampling with Graph-based Smoothing »
Andrew Kirjner · Jason Yim · Raman Samusevich · Tommi Jaakkola · Regina Barzilay · Ila R. Fiete -
2023 : Invited Talk by Tommi Jaakkola »
Tommi Jaakkola -
2023 Poster: PFGM++: Unlocking the Potential of Physics-Inspired Generative Models »
Yilun Xu · Ziming Liu · Yonglong Tian · Shangyuan Tong · Max Tegmark · Tommi Jaakkola -
2023 Poster: Towards Coherent Image Inpainting Using Denoising Diffusion Implicit Models »
Guanhua Zhang · Jiabao Ji · Yang Zhang · Mo Yu · Tommi Jaakkola · Shiyu Chang -
2023 Poster: SE(3) diffusion model with application to protein backbone generation »
Jason Yim · Brian Trippe · Valentin De Bortoli · Emile Mathieu · Arnaud Doucet · Regina Barzilay · Tommi Jaakkola -
2022 Workshop: Workshop on Machine Learning in Computational Design »
Andrew Spielberg · Caitlin Mueller · Lydia Chilton · Rafael Gomez-Bombarelli · Vladimir Kim · Daniel Ritchie · Wengong Jin -
2022 Poster: Learning Stable Classifiers by Transferring Unstable Features »
Yujia Bao · Shiyu Chang · Regina Barzilay -
2022 Poster: Antibody-Antigen Docking and Design via Hierarchical Structure Refinement »
Wengong Jin · Regina Barzilay · Tommi Jaakkola -
2022 Spotlight: Learning Stable Classifiers by Transferring Unstable Features »
Yujia Bao · Shiyu Chang · Regina Barzilay -
2022 Spotlight: Antibody-Antigen Docking and Design via Hierarchical Structure Refinement »
Wengong Jin · Regina Barzilay · Tommi Jaakkola -
2022 Poster: Conformal Prediction Sets with Limited False Positives »
Adam Fisch · Tal Schuster · Tommi Jaakkola · Regina Barzilay -
2022 Poster: EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction »
Hannes Stärk · Octavian Ganea · Lagnajit Pattanaik · Regina Barzilay · Tommi Jaakkola -
2022 Spotlight: Conformal Prediction Sets with Limited False Positives »
Adam Fisch · Tal Schuster · Tommi Jaakkola · Regina Barzilay -
2022 Spotlight: EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction »
Hannes Stärk · Octavian Ganea · Lagnajit Pattanaik · Regina Barzilay · Tommi Jaakkola -
2022 Invited Talk: Solving the Right Problems: Making ML Models Relevant to Healthcare and the Life Sciences »
Regina Barzilay -
2021 Poster: Few-Shot Conformal Prediction with Auxiliary Tasks »
Adam Fisch · Tal Schuster · Tommi Jaakkola · Regina Barzilay -
2021 Poster: Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers »
Yujia Bao · Shiyu Chang · Regina Barzilay -
2021 Spotlight: Few-Shot Conformal Prediction with Auxiliary Tasks »
Adam Fisch · Tal Schuster · Tommi Jaakkola · Regina Barzilay -
2021 Spotlight: Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers »
Yujia Bao · Shiyu Chang · Regina Barzilay -
2021 Poster: Information Obfuscation of Graph Neural Networks »
Peiyuan Liao · Han Zhao · Keyulu Xu · Tommi Jaakkola · Geoff Gordon · Stefanie Jegelka · Ruslan Salakhutdinov -
2021 Spotlight: Information Obfuscation of Graph Neural Networks »
Peiyuan Liao · Han Zhao · Keyulu Xu · Tommi Jaakkola · Geoff Gordon · Stefanie Jegelka · Ruslan Salakhutdinov -
2021 Poster: Learning Task Informed Abstractions »
Xiang Fu · Ge Yang · Pulkit Agrawal · Tommi Jaakkola -
2021 Spotlight: Learning Task Informed Abstractions »
Xiang Fu · Ge Yang · Pulkit Agrawal · Tommi Jaakkola -
2020 : Invited Talk: Tommi Jaakkola »
Tommi Jaakkola -
2020 Poster: Generalization and Representational Limits of Graph Neural Networks »
Vikas K Garg · Stefanie Jegelka · Tommi Jaakkola -
2020 Poster: Educating Text Autoencoders: Latent Representation Guidance via Denoising »
Tianxiao Shen · Jonas Mueller · Regina Barzilay · Tommi Jaakkola -
2020 Poster: Invariant Rationalization »
Shiyu Chang · Yang Zhang · Mo Yu · Tommi Jaakkola -
2020 Poster: Predicting deliberative outcomes »
Vikas K Garg · Tommi Jaakkola -
2020 Poster: Hierarchical Generation of Molecular Graphs using Structural Motifs »
Wengong Jin · Regina Barzilay · Tommi Jaakkola -
2020 Poster: Improving Molecular Design by Stochastic Iterative Target Augmentation »
Kevin Yang · Wengong Jin · Kyle Swanson · Regina Barzilay · Tommi Jaakkola -
2019 Poster: Functional Transparency for Structured Data: a Game-Theoretic Approach »
Guang-He Lee · Wengong Jin · David Alvarez-Melis · Tommi Jaakkola -
2019 Oral: Functional Transparency for Structured Data: a Game-Theoretic Approach »
Guang-He Lee · Wengong Jin · David Alvarez-Melis · Tommi Jaakkola -
2018 Poster: Junction Tree Variational Autoencoder for Molecular Graph Generation »
Wengong Jin · Regina Barzilay · Tommi Jaakkola -
2018 Oral: Junction Tree Variational Autoencoder for Molecular Graph Generation »
Wengong Jin · Regina Barzilay · Tommi Jaakkola -
2017 Poster: Learning Sleep Stages from Radio Signals: A Conditional Adversarial Architecture »
Mingmin Zhao · Shichao Yue · Dina Katabi · Tommi Jaakkola · Matt Bianchi -
2017 Talk: Learning Sleep Stages from Radio Signals: A Conditional Adversarial Architecture »
Mingmin Zhao · Shichao Yue · Dina Katabi · Tommi Jaakkola · Matt Bianchi -
2017 Poster: Sequence to Better Sequence: Continuous Revision of Combinatorial Structures »
Jonas Mueller · David Gifford · Tommi Jaakkola -
2017 Talk: Sequence to Better Sequence: Continuous Revision of Combinatorial Structures »
Jonas Mueller · David Gifford · Tommi Jaakkola -
2017 Poster: Deriving Neural Architectures from Sequence and Graph Kernels »
Tao Lei · Wengong Jin · Regina Barzilay · Tommi Jaakkola -
2017 Talk: Deriving Neural Architectures from Sequence and Graph Kernels »
Tao Lei · Wengong Jin · Regina Barzilay · Tommi Jaakkola