Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
In this way, the time and memory requirements don't
depend on the length of the inputs anymore, as one uses a fixed amount of latent variables, like 256 or 512.