Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jbrinkma
/
Qwen2-0.5B-GRPO-test
like
0
PEFT
Safetensors
trl
grpo
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Use this model
main
Qwen2-0.5B-GRPO-test
Commit History
Model save
1129d29
verified
jbrinkma
commited on
Jun 26
Training in progress, step 110
c173bbb
verified
jbrinkma
commited on
Jun 26
Training in progress, step 100
79d5086
verified
jbrinkma
commited on
Jun 26
Training in progress, step 90
b217fbe
verified
jbrinkma
commited on
Jun 26
Training in progress, step 80
1356447
verified
jbrinkma
commited on
Jun 26
Training in progress, step 70
6acc44f
verified
jbrinkma
commited on
Jun 26
Training in progress, step 60
c62437a
verified
jbrinkma
commited on
Jun 26
Training in progress, step 50
ebf9a96
verified
jbrinkma
commited on
Jun 26
Training in progress, step 40
b725023
verified
jbrinkma
commited on
Jun 26
Training in progress, step 30
0c39f35
verified
jbrinkma
commited on
Jun 26
Training in progress, step 20
4dfefd1
verified
jbrinkma
commited on
Jun 26
Training in progress, step 10
a181ab7
verified
jbrinkma
commited on
Jun 26
initial commit
023fa8f
verified
jbrinkma
commited on
Jun 26