ubergarm commited on
Commit
a518c83
·
1 Parent(s): 3d187fa

initial commit

Browse files
Files changed (2) hide show
  1. .gitattributes +3 -0
  2. README.md +18 -0
.gitattributes CHANGED
@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ *.gguf filter=lfs diff=lfs merge=lfs -text
37
+ *.png filter=lfs diff=lfs merge=lfs -text
38
+ imatrix-*.dat filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,5 +1,23 @@
1
  ---
 
 
 
2
  license: other
3
  license_name: modified-mit
4
  license_link: https://huggingface.co/moonshotai/Kimi-K2-Instruct/raw/main/LICENSE
 
 
 
 
 
 
5
  ---
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ quantized_by: ubergarm
3
+ pipeline_tag: text-generation
4
+ base_model: moonshotai/Kimi-K2-Instruct
5
  license: other
6
  license_name: modified-mit
7
  license_link: https://huggingface.co/moonshotai/Kimi-K2-Instruct/raw/main/LICENSE
8
+ base_model_relation: quantized
9
+ tags:
10
+ - mla
11
+ - imatrix
12
+ - conversational
13
+ - ik_llama.cpp
14
  ---
15
+
16
+ ## Work In Progress
17
+ Hoping to first at least a new `imatrix.gguf` for other folks to quantize their own mainline quants. Then follow up with some of ik_llama.cpp's SOTA quants targeting "smaller" rigs.
18
+
19
+
20
+ Follow along at home:
21
+ * https://github.com/ggml-org/llama.cpp/pull/14654
22
+ * https://huggingface.co/gabriellarson/Kimi-K2-Instruct-GGUF/discussions/1
23
+ * https://github.com/ggml-org/llama.cpp/pull/9400