Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
XLNET combines the best of both BERT and GPT-2's pretraining objectives by using a permutation language modeling objective (PLM) that allows it to learn bidirectionally.