SistInf
/

Velvet-14B-GGUF

Model card Files Files and versions

marco-giachin commited on Feb 12

Commit

edfbbe0

·

verified ·

1 Parent(s): 5407e25

Update README.md

Added quantization details

Files changed (1) hide show

README.md +39 -1

README.md CHANGED Viewed

@@ -14,12 +14,50 @@ extra_gated_description: >-
   our <a href="https://www.almawave.com/privacy-policy/">Privacy Policy</a>.
 tags:
 - vllm
 ---
 # Our quantization process
-**llama.cpp**
 # Model Card for Velvet-14B

   our <a href="https://www.almawave.com/privacy-policy/">Privacy Policy</a>.
 tags:
 - vllm
+base_model:
+- Almawave/Velvet-14B
 ---
 # Our quantization process
+## DESCRIPTION
+**This is a test quantization of Velvet-14B**, converted to GGUF format by modifying the llama.cpp script to make possible the utilization of tokenizer.json.
+**Note:** As of today, llama.cpp does not support this model or this chat template https://github.com/ggerganov/llama.cpp/pull/11716.
+**This model does not represent the intended quality of the original product.**
+Tool used: <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> commit b4677.
+To perform this quantization, we started with llama.cpp as our base, modifying the file convert_hf_to_gguf_update.py to support this model.
+For modifying this file, we based our work on what was seen in the PR https://github.com/ggerganov/llama.cpp/pull/11716.
+Original Model: https://huggingface.co/Almawave/Velvet-14B
+## PROMPT FORMAT
+Basic prompt format:
+```
+<s><instruction>{prompt}</instruction>
+```
+Prompt format with system message:
+```
+<s><instruction>{system_prompt}
+{prompt}</instruction>
+```
+## DOWNLOAD
+| Quant  | Link |
+| ------ | ---- |
+| BF16   | [Almawave-Velvet-14B-BF16](https://huggingface.co/SistInf/Velvet-14B-GGUF/blob/main/Almawave-Velvet-14B-BF16.gguf) |
+| F16    | [Almawave-Velvet-14B-F16.gguf](https://huggingface.co/SistInf/Velvet-14B-GGUF/blob/main/Almawave-Velvet-14B-F16.gguf) |
+| Q4_K_M | [Almawave-Velvet-14B-Q4_K_M](https://huggingface.co/SistInf/Velvet-14B-GGUF/blob/main/Almawave-Velvet-14B-Q4_K_M.gguf) |
+| Q5_K_M | [Almawave-Velvet-14B-Q5_K_M](https://huggingface.co/SistInf/Velvet-14B-GGUF/blob/main/Almawave-Velvet-14B-Q5_K_M.gguf) |
+| Q6_K   | [Almawave-Velvet-14B-Q6_K.gguf](https://huggingface.co/SistInf/Velvet-14B-GGUF/blob/main/Almawave-Velvet-14B-Q6_K.gguf) |
+| Q8_0   | [Almawave-Velvet-14B-Q8_0.gguf](https://huggingface.co/SistInf/Velvet-14B-GGUF/blob/main/Almawave-Velvet-14B-Q8_0.gguf) |
 # Model Card for Velvet-14B