5fa1a76
1
The sequence classification head is a linear layer that accepts the final hidden states and performs a linear transformation to convert them into logits.