Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
After cross-attention, one still has a tensor of shape (batch_size,
2048, 768).