Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yolay
/
DeepSeek-R1-Distill-Qwen-7B-GRPO
like
0
Safetensors
qwen2
Model card
Files
Files and versions
Community
main
DeepSeek-R1-Distill-Qwen-7B-GRPO
/
model-00002-of-00004.safetensors
Commit History
Training in progress, step 500
4b1ddcc
verified
yolay
commited on
Feb 13
Training in progress, step 300
6a82344
verified
yolay
commited on
Feb 12
Training in progress, step 200
00becc3
verified
yolay
commited on
Feb 12
Training in progress, step 100
4c4cfec
verified
yolay
commited on
Feb 11