Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
It's more efficient to dynamically pad the sentences to the longest length in a batch during collation, instead of padding the whole dataset to the maximum length.