Rawdog sparse upcycle (8 experts, 1 shared) of Qwen/QwQ-32B. Note that it'll need training to do anything beyond what baseline QwQ can do at much higher compute requirements.

Unless you want to finetune this, it is not what you're looking for.

Unless you want to finetune this, it is not what you're looking for.

Unless you want to finetune this, it is not what you're looking for.

Downloads last month
8
Safetensors
Model size
250B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for rAIfle/QwQonsortium-8x32B-RAW

Base model

Qwen/Qwen2.5-32B
Finetuned
Qwen/QwQ-32B
Finetuned
(61)
this model