Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
MEGA's compute efficiency allows it to scale to very long sequences, making it an
attractive option for long-document NLP tasks.