Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Following BERT developed in the natural language processing area, we propose a masked image
modeling task to pretrain vision Transformers.