Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
This is a tensor of shape
(batch_size, num_queries, d_model), with num_queries typically set to 100 and initialized with zeros.