stoshniwal commited on
Commit
7aa2c09
·
verified ·
1 Parent(s): 53d9c84

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -13
README.md CHANGED
@@ -114,11 +114,12 @@ outputs = pipeline(
114
  print(outputs[0]["generated_text"][-1]['content'])
115
  ````
116
 
 
 
 
117
  To use the model with GenSelect inference, we recommend following our
118
- [reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py).
119
- Alternatively, you can manually extract the summary from all solutions and use this
120
- [prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems.
121
- We will add the prompt we used for the coding problems and a reference implementation soon!
122
 
123
  You can learn more about GenSelect in these papers:
124
  * [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
@@ -128,21 +129,20 @@ You can learn more about GenSelect in these papers:
128
  ## Citation
129
 
130
  If you find the data useful, please cite:
 
131
  ```
132
- @article{ahmad2025opencodereasoning,
133
- title={OpenCodeReasoning: Advancing Data Distillation for Competitive Coding},
134
- author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
 
135
  year={2025},
136
- eprint={2504.01943},
137
- archivePrefix={arXiv},
138
- primaryClass={cs.CL},
139
- url={https://arxiv.org/abs/2504.01943},
140
  }
141
  ```
142
 
143
  ```
144
  @misc{ahmad2025opencodereasoningiisimpletesttime,
145
- title={OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique},
146
  author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
147
  year={2025},
148
  eprint={2507.09075},
@@ -154,7 +154,7 @@ If you find the data useful, please cite:
154
 
155
  ```
156
  @misc{moshkov2025aimo2winningsolutionbuilding,
157
- title={AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset},
158
  author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
159
  year={2025},
160
  eprint={2504.16891},
@@ -164,6 +164,19 @@ If you find the data useful, please cite:
164
  }
165
  ```
166
 
 
 
 
 
 
 
 
 
 
 
 
 
 
167
  ## Additional Information:
168
 
169
  ### Deployment Geography:
 
114
  print(outputs[0]["generated_text"][-1]['content'])
115
  ````
116
 
117
+ We have added [a simple transformer-based script](https://huggingface.co/nvidia/OpenReasoning-Nemotron-32B/blob/main/genselect_hf.py) in this repo to illustrate GenSelect.
118
+ To learn how to use the models in GenSelect mode with NeMo-Skills, see our [documentation](https://nvidia.github.io/NeMo-Skills/releases/openreasoning/evaluation/).
119
+
120
  To use the model with GenSelect inference, we recommend following our
121
+ [reference implementation in NeMo-Skills](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/pipeline/genselect.py). Alternatively, you can manually extract the summary from all solutions and use this
122
+ [prompt](https://github.com/NVIDIA/NeMo-Skills/blob/main/nemo_skills/prompt/config/openmath/genselect.yaml) for the math problems. We will add the prompt we used for the coding problems and a reference implementation soon!
 
 
123
 
124
  You can learn more about GenSelect in these papers:
125
  * [AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset](https://arxiv.org/abs/2504.16891)
 
129
  ## Citation
130
 
131
  If you find the data useful, please cite:
132
+
133
  ```
134
+ @inproceedings{toshniwal2025genselect,
135
+ title={{GenSelect: A Generative Approach to Best-of-N}},
136
+ author={Shubham Toshniwal and Ivan Sorokin and Aleksander Ficek and Ivan Moshkov and Igor Gitman},
137
+ booktitle={2nd AI for Math Workshop @ ICML 2025},
138
  year={2025},
139
+ url={https://openreview.net/forum?id=8LhnmNmUDb}
 
 
 
140
  }
141
  ```
142
 
143
  ```
144
  @misc{ahmad2025opencodereasoningiisimpletesttime,
145
+ title={{OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique}},
146
  author={Wasi Uddin Ahmad and Somshubra Majumdar and Aleksander Ficek and Sean Narenthiran and Mehrzad Samadi and Jocelyn Huang and Siddhartha Jain and Vahid Noroozi and Boris Ginsburg},
147
  year={2025},
148
  eprint={2507.09075},
 
154
 
155
  ```
156
  @misc{moshkov2025aimo2winningsolutionbuilding,
157
+ title={{AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset}},
158
  author={Ivan Moshkov and Darragh Hanley and Ivan Sorokin and Shubham Toshniwal and Christof Henkel and Benedikt Schifferer and Wei Du and Igor Gitman},
159
  year={2025},
160
  eprint={2504.16891},
 
164
  }
165
  ```
166
 
167
+ ```
168
+ @article{ahmad2025opencodereasoning,
169
+ title={{OpenCodeReasoning: Advancing Data Distillation for Competitive Coding}},
170
+ author={Wasi Uddin Ahmad, Sean Narenthiran, Somshubra Majumdar, Aleksander Ficek, Siddhartha Jain, Jocelyn Huang, Vahid Noroozi, Boris Ginsburg},
171
+ year={2025},
172
+ eprint={2504.01943},
173
+ archivePrefix={arXiv},
174
+ primaryClass={cs.CL},
175
+ url={https://arxiv.org/abs/2504.01943},
176
+ }
177
+ ```
178
+
179
+
180
  ## Additional Information:
181
 
182
  ### Deployment Geography: