For MBart-50 the language id token is used as a prefix | |
for both source and target text i.e the text format is [lang_code] X [eos], where lang_code is source | |
language id for source text and target language id for target text, with X being the source or target text | |
respectively. |