File size: 54 Bytes
5fa1a76
1
VisualBERT is a multi-modal vision and language model.