File size: 97 Bytes
5fa1a76
1
XLNet is not a traditional autoregressive model but uses a training strategy that builds on that.