File size: 101 Bytes
5fa1a76
 
1
2
are pretrained transformer models initially trained to predict the 
next token given some input text.