Skip to yearly menu bar Skip to main content


Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Taishi Nakamura · Satoki Ishikawa · Masaki Kawamura · Takumi Okamoto · Daisuke Nohara · Jun Suzuki · Rio Yokota

Abstract

Chat is not available.