VLLM support

#8
by potanin-marat - opened

Why not adapt the model to vllm, where gguf is not well supported, but AWQ works

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment