This repo includes .gguf built for HuggingFace/Candle. They will not work with llama.cpp.

Refer to the original repo for more details.

Downloads last month
24
Safetensors
Model size
2.78B params
Tensor type
F16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support