Model Card for Model ID

The 13B model of "SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens"

Model Details

Model Description

  • Developed by: ainergy
  • Language(s) (NLP): Code
  • Finetuned from model: CodeLlama-13B

Model Sources

Evaluation

Results

image/png

image/png

Walltime improvement

image/png

Downloads last month
4
Safetensors
Model size
13B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ainergy/CodeLlama-SDSAT_L7_13B

Quantizations
1 model