File size: 76 Bytes
5fa1a76
1
A linear encoder is added to create multimodal embeddings from image inputs.