Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
There is also the [~trl.SFTTrainer] class from the TRL library which wraps the [Trainer] class and is optimized for training language models like Llama-2 and Mistral with autoregressive techniques.