Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
We find that BERT was significantly undertrained, and can match or exceed the performance of every
model published after it.