The sequence classification head is a linear layer that accepts the final hidden states and performs a linear transformation to convert them into logits. |
The sequence classification head is a linear layer that accepts the final hidden states and performs a linear transformation to convert them into logits. |