Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
The Convolutional vision Transformer (CvT) improves the Vision Transformer (ViT) in performance and efficiency by introducing convolutions into ViT to yield the best of both designs.