Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
MBart-50 is created using the original mbart-large-cc25 checkpoint by extendeding
its embedding layers with randomly initialized vectors for an extra set of 25 language tokens and then pretrained on 50
languages.