Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
from transformers import TrainingArguments, Trainer
training_args = TrainingArguments(, neftune_noise_alpha=0.1)
trainer = Trainer(, args=training_args)
NEFTune is disabled after training to restore the original embedding layer to avoid any unexpected behavior.