TIGER-Lab
/

AceCoder-Qwen2.5-7B-Ins-Rule

Model card Files Files and versions Community

DongfuJiang commited on Feb 4

Commit

c0525a0

·

verified ·

1 Parent(s): b4bd9f3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ We introduce AceCoder, the first work to propose a fully automated pipeline for
 ## Note
-- **This model official is trained on the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) with about 22k examples, using the binary pass rate (rule based reward) as the reward.**
 - You can reproduce the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) using [script in our Github](#)
 - The training takes 6 hours to finish on 8 x H100 GPUs in around 80 optimization steps.
 - To reproduce the training, please refer to our [training script in the Github](#)

 ## Note
+- **This model is trained on the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) with about 22k examples, using the binary pass rate (rule based reward) as the reward.**
 - You can reproduce the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) using [script in our Github](#)
 - The training takes 6 hours to finish on 8 x H100 GPUs in around 80 optimization steps.
 - To reproduce the training, please refer to our [training script in the Github](#)