This repo includes .gguf built for HuggingFace/Candle. They will not work with llama.cpp.

Refer to the original repo for more details.

Safetensors

Model size

2.78B params

Tensor type

F16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support