Skip to yearly menu bar Skip to main content


Oral Talk
in
Workshop: Workshop on Theory of Mind in Communicating Agents

Language Models are Pragmatic Speakers

Khanh Nguyen


Abstract:

How do language models “think”? This paper formulates a probabilistic cognitive model called bounded pragmatic speaker that can characterize the operation of different variants of language models. In particular, we show that large language models fine-tuned with reinforcement learning from human feedback (Ouyang et al., 2022) implements a model of thought that conceptually resembles the well-known fast-and-slow model (Kahneman, 2011) which have been largely attributed to humans. We discuss the limitations of reinforcement learning from human feedback as a fast-and-slow model of thought and propose directions for extending this framework. Overall, our work demonstrates that viewing language models through the lens of modular probabilistic models can offer valuable insights for understanding, evaluating, and developing them.

Chat is not available.