Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Here we have the loss since we passed along labels, but we don't have
hidden_states and attentions because we didn't pass output_hidden_states=True or
output_attentions=True.