Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Model predictions are intended to be identical to the original implementation when
forced_bos_token_id=0.