Load DistilGPT2 with [AutoModelForCausalLM]: | |
from transformers import AutoModelForCausalLM, TrainingArguments, Trainer | |
model = AutoModelForCausalLM.from_pretrained("distilbert/distilgpt2") | |
At this point, only three steps remain: | |
Define your training hyperparameters in [TrainingArguments]. |