Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
raw
history blame contribute delete
139 Bytes
Typically, 1e-4 and 3e-4 work well for most problems (classification, summarization, translation, question
answering, question generation).