Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
115 Bytes
If you're doing inference on a CPU with AutoGPTQ (version > 0.4.2), then you'll need to disable the ExLlama kernel.