Why not adapt the model to vllm, where gguf is not well supported, but AWQ works
· Sign up or log in to comment