Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
It has 40% less parameters than
google-bert/bert-base-uncased, runs 60% faster while preserving over 95% of BERT's performances as measured on the GLUE language
understanding benchmark.