Rawdog sparse upcycle (8 experts, 1 shared) of Qwen/QwQ-32B. Note that it'll need training to do anything beyond what baseline QwQ can do at much higher compute requirements.

Unless you want to finetune this, it is not what you're looking for.

Downloads last month: 8

Safetensors

Model size

250B params

Tensor type

BF16

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for rAIfle/QwQonsortium-8x32B-RAW

Base model

Qwen/Qwen2.5-32B

Finetuned

Qwen/QwQ-32B

Finetuned

(61)

this model