Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HanningZhang
/
Qwen-7B-grpo-plusplus-nocliphigher-sample1n8-sample8-filter1.0-insufficient0.0-a0.001-b2.0-iter8
like
0
Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Qwen-7B-grpo-plusplus-nocliphigher-sample1n8-sample8-filter1.0-insufficient0.0-a0.001-b2.0-iter8
Commit History
Upload tokenizer
fb17e54
verified
HanningZhang
commited on
Apr 20
Upload Qwen2ForCausalLM
9222bf2
verified
HanningZhang
commited on
Apr 20
initial commit
31eddc8
verified
HanningZhang
commited on
Apr 20