sunblaze-ucb/Qwen2.5-1.5B-Intuitor-MATH-1EPOCH · Improve model card: Add transformers library, expand description, links, and usage

nielsr

14 days ago

This PR significantly enhances the model card for Qwen2.5-1.5B-Intuitor-MATH-1EPOCH by:

Adding library_name: transformers to the metadata, enabling direct usage via the Transformers library and ensuring the "how to use" button appears correctly.
Integrating a detailed overview of the "Intuitor" method and the "Reinforcement Learning from Internal Feedback (RLIF)" framework, including key highlights and relevant figures from the paper's GitHub repository.
Including a direct link to the official GitHub repository (https://github.com/sunblaze-ucb/rlif), providing easy access to the source code and further resources.
Adding a comprehensive sample usage snippet for text generation with the transformers library, demonstrating how to interact with the model using its chat template.
Incorporating benchmarks and results from the paper, giving users insight into the model's performance on various reasoning and generation tasks.
Adding a direct link to the paper on the Hugging Face Papers platform (https://huggingface.co/papers/2505.19590).

These updates aim to provide a more complete, accurate, and user-friendly model card for the community.

Xuandong changed pull request status to merged 14 days ago