Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The architecture of BLOOM is essentially similar to GPT3 (auto-regressive model for next token prediction), but has been trained on 46 different languages and 13 programming languages.