🌎 ⚡️ Inference - A notebook on how to quantize the Llama 2 model using GPTQ from the AutoGPTQ library.