Timezone: »
Authors: David Dohan, Aitor Lewkowycz, Jacob Austin, Winnie Xu, Yuhuai Wu, David Bieber, Raphael Gontijo-Lopes, Henryk Michalewski, Rif A. Saurous, Jascha Sohl-Dickstein, Kevin Patrick Murphy, Charles Sutton
Abstract: Prompted models have demonstrated impressive few-shot learning abilities. Repeated interactions at test-time with a single model, or the composition of multiple models together, further expands capabilities. These compositions are probabilistic models, and may be expressed in the language of graphical models with random variables whose values are complex data types such as strings. Cases with control flow and dynamic structure require techniques from probabilistic programming, and allow implementing disparate model structures and inference strategies in a unified language. We describe several existing techniques from this perspective, including scratchpads and chain of thought, verifiers, STaR, selection-inference, and tool use. We refer to the resulting programs as \emph{language model \cascades}.
Author Information
David Dohan (Google)
Winnie Xu (University of Toronto)

Winnie recently graduated with an H.BSc from the University of Toronto where she majored in Computer Science and specialized in Artificial Intelligence. Her research interests span broadly in generative models with probabilistic interpretations and differentiable numerical algorithms. As an undergraduate, she researched latent variable models, variational inference, and Neural ODEs / SDEs with David Duvenaud. She is currently a student researcher at Google Brain collaborating with Stanford University where she is working on efficient methods for training diffusion models and doing Bayesian program induction with large language models in reasoning tasks. In the recent past, she has also collaborated with Nvidia Research, Oxford (OATML), and Cohere AI on topics in robotics, large language models, and NLP.
More from the Same Authors
-
2022 : [Poster] Self–Similarity Priors: Neural Collages as Differentiable Fractal Representations »
Winnie Xu -
2023 Poster: Large Language Models Can Be Easily Distracted by Irrelevant Context »
Haoyue Shi · Xinyun Chen · Kanishka Misra · Nathan Scales · David Dohan · Ed Chi · Nathanael Schärli · Denny Zhou -
2022 : Contributed Spotlight Talks: Part 1 »
David Dohan · Winnie Xu · Sugandha Sharma · Tan Zhi-Xuan -
2022 Poster: Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt »
Sören Mindermann · Jan Brauner · Muhammed Razzak · Mrinank Sharma · Andreas Kirsch · Winnie Xu · Benedikt Höltgen · Aidan Gomez · Adrien Morisot · Sebastian Farquhar · Yarin Gal -
2022 Spotlight: Prioritized Training on Points that are Learnable, Worth Learning, and not yet Learnt »
Sören Mindermann · Jan Brauner · Muhammed Razzak · Mrinank Sharma · Andreas Kirsch · Winnie Xu · Benedikt Höltgen · Aidan Gomez · Adrien Morisot · Sebastian Farquhar · Yarin Gal -
2022 : Poster Session 2 »
Asra Aslam · Sowmya Vijayakumar · Heta Gandhi · Mary Adewunmi · You Cheng · Tong Yang · Kristina Ulicna · · Weiwei Zong · Narmada Naik · Akshata Tiwari · Ambreen Hamadani · Mayuree Binjolkar · Charupriya Sharma · Chhavi Yadav · Yu Yang · Winnie Xu · QINGQING ZHAO · Julissa Giuliana Villanueva Llerena · Lilian Mkonyi · Berthine Nyunga Mpinda · Rehema Mwawado · Tooba Imtiaz · Desi Ivanova · Emma Johanna Mikaela Petersson Svensson · Angela Bitto-Nemling · Elisabeth Rumetshofer · Ana Sanchez Fernandez · Garima Giri · Sigrid Passano Hellan · Catherine Ordun · Vasiliki Tassopoulou · Gina Wong -
2022 : Poster Session 1 »
Asra Aslam · Sowmya Vijayakumar · Heta Gandhi · Mary Adewunmi · You Cheng · Tong Yang · Kristina Ulicna · · Weiwei Zong · Narmada Naik · Akshata Tiwari · Ambreen Hamadani · Mayuree Binjolkar · Charupriya Sharma · Chhavi Yadav · Yu Yang · Winnie Xu · QINGQING ZHAO · Julissa Giuliana Villanueva Llerena · Lilian Mkonyi · Berthine Nyunga Mpinda · Rehema Mwawado · Tooba Imtiaz · Desi Ivanova · Emma Johanna Mikaela Petersson Svensson · Angela Bitto-Nemling · Elisabeth Rumetshofer · Ana Sanchez Fernandez · Garima Giri · Sigrid Passano Hellan · Catherine Ordun · Vasiliki Tassopoulou · Gina Wong -
2021 Poster: Latent Programmer: Discrete Latent Codes for Program Synthesis »
Joey Hong · David Dohan · Rishabh Singh · Charles Sutton · Manzil Zaheer -
2021 Oral: Latent Programmer: Discrete Latent Codes for Program Synthesis »
Joey Hong · David Dohan · Rishabh Singh · Charles Sutton · Manzil Zaheer -
2020 Poster: Population-Based Black-Box Optimization for Biological Sequence Design »
Christof Angermueller · David Belanger · Andreea Gane · Zelda Mariet · David Dohan · Kevin Murphy · Lucy Colwell · D. Sculley