CroissantLLM weights in Machine Learning Compilation (MLC) format, with q0f32 quantization.
Learn more about Machine Learning Compilation and how to use these weights here.
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support