DeepSeek-R1-Distill-Qwen-7B-GRPO / model-00002-of-00004.safetensors

Commit History

Training in progress, step 500
4b1ddcc
verified

yolay commited on

Training in progress, step 300
6a82344
verified

yolay commited on

Training in progress, step 200
00becc3
verified

yolay commited on

Training in progress, step 100
4c4cfec
verified

yolay commited on