Improve model card: Add transformers library, expand description, links, and usage
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for Qwen2.5-1.5B-Intuitor-MATH-1EPOCH
by:
- Adding
library_name: transformers
to the metadata, enabling direct usage via the Transformers library and ensuring the "how to use" button appears correctly. - Integrating a detailed overview of the "Intuitor" method and the "Reinforcement Learning from Internal Feedback (RLIF)" framework, including key highlights and relevant figures from the paper's GitHub repository.
- Including a direct link to the official GitHub repository (
https://github.com/sunblaze-ucb/rlif
), providing easy access to the source code and further resources. - Adding a comprehensive sample usage snippet for text generation with the
transformers
library, demonstrating how to interact with the model using its chat template. - Incorporating benchmarks and results from the paper, giving users insight into the model's performance on various reasoning and generation tasks.
- Adding a direct link to the paper on the Hugging Face Papers platform (https://huggingface.co/papers/2505.19590).
These updates aim to provide a more complete, accurate, and user-friendly model card for the community.
Xuandong
changed pull request status to
merged