Skip to yearly menu bar Skip to main content


Generalization vs. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Antonis Antoniades · Xinyi Wang · Yanai Elazar · Alfonso Amayuelas · Alon Albalak · Kexun Zhang · William Wang

Abstract

Chat is not available.