Safetensors
llama
text-generation-inference