Ahmadzei's picture
added 3 more tables for large emb model
5fa1a76
Load DistilGPT2 with [AutoModelForCausalLM]:
from transformers import AutoModelForCausalLM, TrainingArguments, Trainer
model = AutoModelForCausalLM.from_pretrained("distilbert/distilgpt2")
At this point, only three steps remain:
Define your training hyperparameters in [TrainingArguments].