Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
chloeli
/
qwen-2.5-0.5B-instruct-sft-lora-countdown-search-1k-old
like
0
Text Generation
Transformers
Safetensors
chloeli/stream-of-search-countdown-10k
qwen2
Generated from Trainer
alignment-handbook
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
qwen-2.5-0.5B-instruct-sft-lora-countdown-search-1k-old
Commit History
End of training
aa13ba8
verified
chloeli
commited on
Mar 24
Model save
fb5dc30
verified
chloeli
commited on
Mar 24
Training in progress, step 125
6f8da50
verified
chloeli
commited on
Mar 24
Training in progress, step 100
c0f55eb
verified
chloeli
commited on
Mar 24
initial commit
ad61c14
verified
chloeli
commited on
Mar 24