Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

359 Bytes

	If you are using throughput (you want to run your model on a bunch of static data), on GPU, then:

	If you have no clue about the size of the sequence_length ("natural" data), by default don't batch, measure and
	try tentatively to add it, add OOM checks to recover when it will fail (and it will at some point if you don't
	control the sequence_length.)