JPishikawa
/

Llama-3.3-Swallow-70B-Instruct-v0.4-W4A16

Text Generation

text-generation-inference

compressed-tensors

Model card Files Files and versions Community

This model is the quantized version of tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4 by LLM Compressor.
This model adheres to the same licensing terms as the original model.

Downloads last month: 3

Safetensors

Model size

11.2B params

Tensor type

I64

·

I32

·

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for JPishikawa/Llama-3.3-Swallow-70B-Instruct-v0.4-W4A16

Base model

tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4

Quantized

(9)

this model