Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
While representation learning in NLP has transitioned to training on raw text without human annotations, visual and vision-language representations still rely heavily on curated training datasets that are expensive or require expert knowledge.