Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
NA is a pixel-wise operation, localizing self attention (SA) to the nearest neighboring pixels, and therefore enjoys a
linear time and space complexity compared to the quadratic complexity of SA.