Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
116 Bytes
GPT-J was followed by OPT, a family of decoder-only models, the largest of which is 175B and trained on 180B tokens.