JPishikawa
/

Llama-3.3-Swallow-70B-Instruct-v0.4-FP8-Dynamic

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions Community

This model is the quantized version of tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4 by LLM Compressor.
This model adheres to the same licensing terms as the original model.

Downloads last month: 24

Safetensors

Model size

70.6B params

Tensor type

BF16

·

F8_E4M3

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for JPishikawa/Llama-3.3-Swallow-70B-Instruct-v0.4-FP8-Dynamic

Base model

tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4

Quantized

(9)

this model