Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Assigning the label -100 to the special tokens [CLS] and [SEP] so they're ignored by the PyTorch loss function (see CrossEntropyLoss).