Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Pre-trained on
ImageNet-22k, our CvT-W24 obtains a top-1 accuracy of 87.7\% on the ImageNet-1k val set.