Skip to yearly menu bar Skip to main content


Poster

MergeMix: Optimizing Mid-Training Data Mixtures via Learnable Model Merging

jiapeng wang ⋅ Changxin Tian ⋅ Kunlong Chen ⋅ ziqi liu ⋅ Jiaxin Mao ⋅ Xin Zhao ⋅ Zhiqiang Zhang ⋅ JUN ZHOU

Abstract

Log in and register to view live content