Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
CLIP
(Contrastive Language-Image Pre-Training) is a neural network trained on a variety of (image, text) pairs.