File size: 210 Bytes
5fa1a76
 
 
1
2
3
Meanwhile, it also integrates a spatial-aware self-attention
mechanism into the Transformer architecture, so that the model can fully understand the relative positional
relationship among different text blocks.