Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
By default, it
will only label the first wordpiece of a word, and label the remaining wordpieces with -100, which is the
ignore_index of PyTorch's CrossEntropyLoss.