Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Training is computationally expensive, often done on private datasets of different sizes,
and, as we will show, hyperparameter choices have significant impact on the final results.