Spaces:

Ahmadzei
/

RAG

Runtime error

added 3 more tables for large emb model

5fa1a76 over 1 year ago

222 Bytes

	In this work, we present our techniques for training very large transformer models and implement a simple,
	efficient intra-layer model parallel approach that enables training transformer models with billions of parameters.