Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
erayalp
/
qwen2.5-0.5b-instruct-GRPO-v3-tr-math-gsm8k
like
0
Text Generation
Transformers
Safetensors
ytu-ce-cosmos/gsm8k_tr
Turkish
English
group-relative-policy-optimization
reinforcement-learning
curriculum-learning
math
supervised-fine-tuning
reasoning
turkish
conversational
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
qwen2.5-0.5b-instruct-GRPO-v3-tr-math-gsm8k
/
.gitattributes
Commit History
Upload folder using huggingface_hub
6702d07
verified
erayalp
commited on
May 20
initial commit
df807f7
verified
erayalp
commited on
May 20