5fa1a76
1
2
It is efficient at predicting masked tokens and at NLU in general, but is not optimal for text generation.