Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Transformers supports several quantization schemes to help you run inference with large language models (LLMs) and finetune adapters on quantized models.