llm-jp 's Collections

Optimal Sparsity Math

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks