fangwu97
/

Qwen2.5-0.5B-Instruct-GRPO-test

Generated from Trainer

Model card Files Files and versions Community

Qwen2.5-0.5B-Instruct-GRPO-test

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

fangwu97's picture

Training in progress, step 10

db341c8 verified 19 days ago

.gitattributes

1.57 kB

Training in progress, step 10 19 days ago
README.md

2.38 kB

Training in progress, step 10 19 days ago
adapter_config.json

721 Bytes

Training in progress, step 10 19 days ago
adapter_model.safetensors

1.09 MB
LFS

Training in progress, step 10 19 days ago
added_tokens.json

605 Bytes

Training in progress, step 10 19 days ago
merges.txt

1.67 MB

Training in progress, step 10 19 days ago
special_tokens_map.json

613 Bytes

Training in progress, step 10 19 days ago
tokenizer.json

11.4 MB
LFS

Training in progress, step 10 19 days ago
tokenizer_config.json

7.36 kB

Training in progress, step 10 19 days ago
training_args.bin
Detected Pickle imports (14)
- "torch.device",
- "transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.integrations.deepspeed.HfDeepSpeedConfig",
- "accelerate.utils.dataclasses.DeepSpeedPlugin",
- "transformers.trainer_utils.SchedulerType",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.HubStrategy",
- "torch.bfloat16",
- "accelerate.utils.dataclasses.DistributedType",
- "grpo_config.GRPOConfig",
- "transformers.training_args.OptimizerNames",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.trainer_utils.IntervalStrategy"
How to fix it?
7.67 kB
LFS

Training in progress, step 10 19 days ago
vocab.json

2.78 MB

Training in progress, step 10 19 days ago