Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Many of the [Trainer]'s method can be subclassed and overridden to support the functionality you want, without having to rewrite the entire training loop from scratch to accommodate it.