ainergy
/

CodeLlama-SDSAT_L7_13B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Model Card for Model ID

The 13B model of "SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens"

Model Details

Model Description

Developed by: ainergy
Language(s) (NLP): Code
Finetuned from model: CodeLlama-13B

Model Sources

Repository: https://github.com/ainergy-ml/SDSAT
Paper: https://arxiv.org/abs/2403.18647

Evaluation

Results

Walltime improvement

Downloads last month: 4

Safetensors

Model size

13B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ainergy/CodeLlama-SDSAT_L7_13B

Quantizations

1 model