DongfuJiang commited on
Commit
c0525a0
·
verified ·
1 Parent(s): b4bd9f3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -28,7 +28,7 @@ We introduce AceCoder, the first work to propose a fully automated pipeline for
28
 
29
 
30
  ## Note
31
- - **This model official is trained on the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) with about 22k examples, using the binary pass rate (rule based reward) as the reward.**
32
  - You can reproduce the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) using [script in our Github](#)
33
  - The training takes 6 hours to finish on 8 x H100 GPUs in around 80 optimization steps.
34
  - To reproduce the training, please refer to our [training script in the Github](#)
 
28
 
29
 
30
  ## Note
31
+ - **This model is trained on the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) with about 22k examples, using the binary pass rate (rule based reward) as the reward.**
32
  - You can reproduce the hard version of [TIGER-Lab/AceCode-89K](https://huggingface.co/datasets/TIGER-Lab/AceCode-89K) using [script in our Github](#)
33
  - The training takes 6 hours to finish on 8 x H100 GPUs in around 80 optimization steps.
34
  - To reproduce the training, please refer to our [training script in the Github](#)