quantization? fine-tune?
#68
by
tcporco
- opened
Did anyone make a quantized version of this, or fine tune it? Has anyone ever run it with anything other than 8 A100s?
exactly what i was wondering. If not, i dont understand how this differs from just using the chat bot. Its just an extension of the chat bot as an API to charge