File size: 147 Bytes
5fa1a76
1
The output from the decoder is passed to a language modeling head, which performs a linear transformation to convert the hidden states into logits.