Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Pre-trained models with a differentiable access mechanism to explicit nonparametric
memory can overcome this issue, but have so far been only investigated for extractive downstream tasks.