add perplexity chart for v0.2 quants

Files changed (2) hide show

README.md CHANGED Viewed

@@ -32,13 +32,15 @@ Also thanks to all the folks in the quanting and inferencing community on [Beave
 ## *UPDATED RECIPES*
 Updated new better lower perplexity recipes and worlds smallest Kimi-K2-Instruct-smol-IQ1_KT at 219.375 GIB (1.835) BPW. Please ask any questions in [this discussion here](https://huggingface.co/ubergarm/Kimi-K2-Instruct-GGUF/discussions/4), thanks!
-Look there for graph with new values. I'll update the model card after the dust has settled. Old versions still available as described in the dicsussion.
 ## Quant Collection
 Compare with Perplexity of full size `Q8_0` 1016.623 GiB (8.504 BPW):
 Final estimate: PPL = 2.9507 +/- 0.01468
 ### * v0.2 `IQ4_KS` 554.421 GiB (4.638 BPW)
 Final estimate: PPL = 2.9584 +/- 0.01473

 ## *UPDATED RECIPES*
 Updated new better lower perplexity recipes and worlds smallest Kimi-K2-Instruct-smol-IQ1_KT at 219.375 GIB (1.835) BPW. Please ask any questions in [this discussion here](https://huggingface.co/ubergarm/Kimi-K2-Instruct-GGUF/discussions/4), thanks!
+Old versions still available as described in the dicsussion at tag/revision v0.1.
 ## Quant Collection
 Compare with Perplexity of full size `Q8_0` 1016.623 GiB (8.504 BPW):
 Final estimate: PPL = 2.9507 +/- 0.01468
+![Perplexity Chart](images/perplexity.png "Chart showing Perplexity improving as BPW increases.")
 ### * v0.2 `IQ4_KS` 554.421 GiB (4.638 BPW)
 Final estimate: PPL = 2.9584 +/- 0.01473

images/perplexity.png ADDED Viewed