Can you provide specific quantification methods and scripts?
1
#2 opened 3 days ago
by
yangbohust
Does vllm 0.8.4 support this quantized model?
1
#1 opened 9 days ago
by
traphix