Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
137 Bytes
[OwlViTProcessor] wraps [OwlViTImageProcessor] and [CLIPTokenizer] into a single instance to both encode the text and prepare the images.