Supervised training | |
thon | |
from transformers import MBartForConditionalGeneration, MBart50TokenizerFast | |
model = MBartForConditionalGeneration.from_pretrained("facebook/mbart-large-50") | |
tokenizer = MBart50TokenizerFast.from_pretrained("facebook/mbart-large-50", src_lang="en_XX", tgt_lang="ro_RO") | |
src_text = " UN Chief Says There Is No Military Solution in Syria" | |
tgt_text = "Şeful ONU declară că nu există o soluţie militară în Siria" | |
model_inputs = tokenizer(src_text, text_target=tgt_text, return_tensors="pt") | |
model(**model_inputs) # forward pass | |
Generation | |
To generate using the mBART-50 multilingual translation models, eos_token_id is used as the | |
decoder_start_token_id and the target language id is forced as the first generated token. |