Usage tips | |
[LEDForConditionalGeneration] is an extension of | |
[BartForConditionalGeneration] exchanging the traditional self-attention layer with | |
Longformer's chunked self-attention layer. |
Usage tips | |
[LEDForConditionalGeneration] is an extension of | |
[BartForConditionalGeneration] exchanging the traditional self-attention layer with | |
Longformer's chunked self-attention layer. |