Knowledge Retrieval and Language Models

Workshop

Knowledge Retrieval and Language Models

Maithra Raghu · Urvashi Khandelwal · Chiyuan Zhang · Matei Zaharia · Alexander Rush

Hall F

Fri 22 Jul, 5:45 a.m. PDT

[ Abstract ] Workshop Website

In just the past couple of years, we have seen significant advances in the capabilities of (Large) Language Models. One of the most striking capabilities of these systems is knowledge retrieval — Language Models can answer a diverse set of questions, which differ substantially in the domain knowledge needed for their responses, and their input structure. The precise methods for knowledge retrieval vary from the language model directly generating a response (parametric approaches) to a combination of generation and referencing an external knowledge corpus, e.g. retrieval augmented generation, to primarily using an external knowledge corpus with language model embeddings (semi-parametric approaches.) Despite the rapid advances, there remain many pressing open questions on the limits of knowledge retrieval with language models, and connections between these different approaches. How factual are generated responses, and how does this vary with question complexity, model scale, and importantly, different methods of knowledge retrieval? How important is the role of (self-supervised/supervised) pretraining? What are the tradeoffs between few-shot (prompt based) approaches and finetuning when adapting to novel domains? And relatedly, to what extent do different knowledge retrieval approaches generalize to unseen settings? This workshop seeks to bring together a diverse set of researchers across NLP, Machine Learning and Theory to discuss these questions. We hope to share current findings and challenges, identify promising directions for future study, and most importantly, build a community around this topic at this pivotal time.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Fri 5:45 a.m. - 6:00 a.m.	Opening Remarks ( Introduction ) >	🔗
Fri 6:00 a.m. - 6:30 a.m.	Sebastian Riedel ( Talk ) > SlidesLive Video	🔗
Fri 6:30 a.m. - 7:00 a.m.	Nils Reimers ( Talk ) > SlidesLive Video	🔗
Fri 7:00 a.m. - 7:30 a.m.	Break	🔗
Fri 7:30 a.m. - 8:00 a.m.	John Schulman ( Talk ) > SlidesLive Video	🔗
Fri 8:00 a.m. - 9:00 a.m.	Poster Session ( Poster Session ) >	🔗
Fri 9:00 a.m. - 10:30 a.m.	Lunch	🔗
Fri 10:30 a.m. - 11:00 a.m.	Jimmy Lin ( Talk ) > SlidesLive Video	🔗
Fri 11:00 a.m. - 11:30 a.m.	Ellie Pavlick ( Talk ) > SlidesLive Video	🔗
Fri 11:30 a.m. - 11:40 a.m.	Contributed Talk 1: Dialog Inpainting: Turning Documents into Dialogs ( Talk ) > SlidesLive Video	🔗
Fri 11:40 a.m. - 11:50 a.m.	Contributed Talk 2: Huge Frozen Language Models as Readers for Open-Domain Question Answering ( Talk ) > SlidesLive Video	🔗
Fri 11:50 a.m. - 12:00 p.m.	Contributed Talk 3: LinkBERT: Pretraining Language Models with Document Links ( Talk ) > SlidesLive Video	🔗
Fri 12:00 p.m. - 12:30 p.m.	Break	🔗
Fri 12:30 p.m. - 1:00 p.m.	Danqi Chen ( Talk ) > SlidesLive Video	🔗
Fri 1:00 p.m. - 1:30 p.m.	Quoc Le ( Talk ) > SlidesLive Video	🔗
Fri 1:30 p.m. - 2:30 p.m.	Panel ( Panel ) > SlidesLive Video	🔗
Fri 2:30 p.m. - 2:40 p.m.	Closing Remarks ( Closing Remarks ) >	🔗