Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Shahradmz
/
Qwen2-0.5B-Reward_debug_mas
like
0
Text Classification
Transformers
Safetensors
continual_data_debug_MAS_1
qwen2
Generated from Trainer
trl
reward-trainer
text-generation-inference
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
Qwen2-0.5B-Reward_debug_mas
/
model.safetensors
Commit History
Training in progress, step 12
82d3dea
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
7d2d6cf
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
a2568f0
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
a6771d4
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
af48a5d
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
f34fdd6
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
15a2699
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
ce00c24
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
4e68b85
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
f56e4b3
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
fdbedbd
verified
Shahradmz
commited on
Mar 19
Training in progress, step 12
cdb30ea
verified
Shahradmz
commited on
Mar 19