Skip to yearly menu bar Skip to main content


Poster

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Tianzhe Chu ⋅ Yuexiang Zhai ⋅ Jihan Yang ⋅ Shengbang Tong ⋅ Saining Xie ⋅ Dale Schuurmans ⋅ Quoc Le ⋅ Sergey Levine ⋅ Yi Ma
2025 Poster

Abstract

Lay Summary

Video

Chat is not available.