Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
DETR adds position embeddings to the hidden states at each self-attention and cross-attention layer before projecting
to queries and keys.