Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Our analysis of the scaling properties of this setup shows that increasing image-level pre-training and model size yield consistent improvements on the downstream detection task.