Spaces:

Ahmadzei
/

RAG

Runtime error

RAG

File size: 114 Bytes

5fa1a76

4-bit quantization compresses a model even further, and it is commonly used with QLoRA to finetune quantized LLMs.