Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
DETR
consists of a convolutional backbone followed by an encoder-decoder Transformer which can be trained end-to-end for
object detection.