Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
There's also this blog post which explains how
generation works in general in encoder-decoder models.