Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Perceiver aims to solve this issue by, instead of performing self-attention on the inputs, perform it on a set
of latent variables, and only use the inputs for cross-attention.