Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
We show that this reliance on CNNs is not necessary and a pure transformer applied directly to
sequences of image patches can perform very well on image classification tasks.