Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
We are also
competitive with self-supervised benchmarks on ImageNet when substituting pixels for a VQVAE encoding, achieving 69.0%
top-1 accuracy on a linear probe of our features.