Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

138 Bytes

Some preselected input tokens are still given global attention, but the attention matrix has way less parameters, resulting in a speed-up.