Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
We study
the performance of this approach by benchmarking on over 30 different existing computer vision datasets, spanning tasks
such as OCR, action recognition in videos, geo-localization, and many types of fine-grained object classification.