add link to technical report
Browse files
README.md
CHANGED
@@ -50,6 +50,7 @@ Developers designing AI Agent systems, chatbots, RAG systems, and other AI-power
|
|
50 |
|
51 |
## References
|
52 |
|
|
|
53 |
- [\[2502.00203\] Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment](https://arxiv.org/abs/2502.00203)
|
54 |
|
55 |
|
|
|
50 |
|
51 |
## References
|
52 |
|
53 |
+
- [\[2505.00949\] Llama-Nemotron: Efficient Reasoning Models](https://arxiv.org/abs/2505.00949)
|
54 |
- [\[2502.00203\] Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment](https://arxiv.org/abs/2502.00203)
|
55 |
|
56 |
|