Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
GPT2, as well as the pretrained decoder part of sequence-to-sequence models, e.g.