Skip to yearly menu bar Skip to main content


Knowledge Distillation for Efficient Sequences of Training Runs

Xingyu Liu ⋅ Xingyu Liu ⋅ Alexander Leonardi ⋅ Alexander Leonardi ⋅ Lu Yu ⋅ Lu Yu ⋅ Christopher Gilmer-Hill ⋅ Christopher Gilmer-Hill ⋅ Matthew Leavitt ⋅ Matthew Leavitt ⋅ Jonathan Frankle ⋅ Jonathan Frankle

Abstract

Chat is not available.