File size: 196 Bytes
5fa1a76
 
 
1
2
3
To address this limitation, we introduce the Longformer with an attention
mechanism that scales linearly with sequence length, making it easy to process documents of thousands of tokens or
longer.