yolay
/

DeepSeek-R1-Distill-Qwen-7B-GRPO

Model card Files Files and versions Community

DeepSeek-R1-Distill-Qwen-7B-GRPO / model-00002-of-00004.safetensors

Commit History

Training in progress, step 500

4b1ddcc
verified

yolay commited on Feb 13

Training in progress, step 300

6a82344
verified

yolay commited on Feb 12

Training in progress, step 200

00becc3
verified

yolay commited on Feb 12

Training in progress, step 100

4c4cfec
verified

yolay commited on Feb 11