Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

321 Bytes

	Using Longformer self attention, the memory and time complexity of the query-key matmul operation, which usually
	represents the memory and time bottleneck, can be reduced from \(\mathcal{O}(n_s \times n_s)\) to
	\(\mathcal{O}(n_s \times w)\), with \(n_s\) being the sequence length and \(w\) being the average window
	size.