Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

moonshotai
/
Moonlight-16B-A3B

Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
Model card Files Files and versions Community
6
Moonlight-16B-A3B / figures
Ctrl+K
Ctrl+K
  • 4 contributors
History: 1 commit
liushaowei
add figures
b13722c 3 months ago
  • banner.png
    48.8 kB
    add figures 3 months ago
  • banner_short.png
    26.9 kB
    add figures 3 months ago
  • chinlaw_8k_flops_ratio.png
    145 kB
    add figures 3 months ago
  • fig_MMLU_performance.png
    225 kB
    add figures 3 months ago
  • fig_weight_decay.png
    416 kB
    add figures 3 months ago
  • logo.png
    13.1 kB
    add figures 3 months ago
  • megatron.png
    1.99 kB
    add figures 3 months ago
  • scaling.png
    224 kB
    add figures 3 months ago