Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
This paper introduces a new attention layer based on convolution operations able to capture both local and distant relationships.