XLM-ProphetNet is an encoder-decoder model and can predict n-future tokens for "ngram" language modeling instead of | |
just the next token. |
XLM-ProphetNet is an encoder-decoder model and can predict n-future tokens for "ngram" language modeling instead of | |
just the next token. |