Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Usage example
ALIGN uses EfficientNet to get visual features and BERT to get the text features.