Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Meaning you don't have to care
about how many forward passes you inputs are actually going to trigger, you can optimize the batch_size
independently of the inputs.