In this case, you should remove overwrite_output_dir: | |
python examples/pytorch/summarization/run_summarization.py | |
--model_name_or_path google-t5/t5-small \ | |
--do_train \ | |
--do_eval \ | |
--dataset_name cnn_dailymail \ | |
--dataset_config "3.0.0" \ | |
--source_prefix "summarize: " \ | |
--output_dir /tmp/tst-summarization \ | |
--per_device_train_batch_size=4 \ | |
--per_device_eval_batch_size=4 \ | |
--output_dir previous_output_dir \ | |
--predict_with_generate | |
The second method uses the resume_from_checkpoint path_to_specific_checkpoint argument to resume training from a specific checkpoint folder. |