File size: 536 Bytes
281b6ea
 
 
 
 
09096cc
 
 
 
 
 
 
 
281b6ea
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
## How to cite

If you find our work helpful, please feel free to cite the paper.

```
@article{nakamura2025optimalsparsitymixtureofexpertslanguage,
      title={Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks}, 
      author={Taishi Nakamura and Satoki Ishikawa and Masaki Kawamura and Takumi Okamoto and Daisuke Nohara and Jun Suzuki and Rio Yokota},
      year={2025},
      eprint={2508.18672},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2508.18672}, 
}
```