Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
XLM-ProphetNet is an encoder-decoder model and can predict n-future tokens for "ngram" language modeling instead of
just the next token.