cognitivecomputations
/

DeepSeek-R1-AWQ

Text Generation

4-bit precision

Model card Files Files and versions Community

Resources

View closed (23)

AMD Instinct MI210 + vllm fail to run this model, any solutions please? Is there any other deepseek-r1-671b models that can run succesfully on AMD Instinct MI210 + vllm? Thanks!

#33 opened 16 days ago by

More stable startup command, not easy oom.

#31 opened about 1 month ago by

The awq quantization model may encounter garbled characters when performing inference on long texts.

#24 opened about 2 months ago by

Add instructions to run R1-AWQ on SGLang

#22 opened about 2 months ago by

requests get stuck when sending long prompts (already solved, but still don't know why?)

#18 opened 2 months ago by

Is there any accuracy results comparing to original DeepSeek-R1？

#15 opened 2 months ago by

Any one can run this model with SGlang framework？

#13 opened 2 months ago by

Regarding the issue of inconsistent calculation of tokens

#12 opened 2 months ago by

Max-Batch-Size, max-num-sequence, and fp_cache fp8_e4m3

#11 opened 2 months ago by

The inference performance of the DeepSeek-R1-AWQ model is weak compared to the DeepSeek-R1 model

#3 opened 3 months ago by