This model is the quantized version of tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4 by LLM Compressor.
This model adheres to the same licensing terms as the original model.

Downloads last month
3
Safetensors
Model size
11.2B params
Tensor type
I64
I32
BF16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for JPishikawa/Llama-3.3-Swallow-70B-Instruct-v0.4-W4A16

Quantized
(9)
this model