Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
288 Bytes
Lastly, we demonstrate detailed ablation studies to prove that both our novel
model components and pretraining strategies significantly contribute to our strong results; and also present several
attention visualizations for the different encoders
This model was contributed by eltoto1219.