Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Importantly, predicted targets for pre-training are contextualized latent representations of the inputs, rather than modality-specific, context-independent targets.