Oral Talk
in
Workshop: Workshop on Theory of Mind in Communicating Agents
Language Models are Pragmatic Speakers
Khanh Nguyen
How do language models “think”? This paper formulates a probabilistic cognitive model called bounded pragmatic speaker that can characterize the operation of different variants of language models. In particular, we show that large language models fine-tuned with reinforcement learning from human feedback (Ouyang et al., 2022) implements a model of thought that conceptually resembles the well-known fast-and-slow model (Kahneman, 2011) which have been largely attributed to humans. We discuss the limitations of reinforcement learning from human feedback as a fast-and-slow model of thought and propose directions for extending this framework. Overall, our work demonstrates that viewing language models through the lens of modular probabilistic models can offer valuable insights for understanding, evaluating, and developing them.