Improve model card: Add transformers library, expand description, links, and usage

#1
by nielsr HF Staff - opened

This PR significantly enhances the model card for Qwen2.5-1.5B-Intuitor-MATH-1EPOCH by:

  • Adding library_name: transformers to the metadata, enabling direct usage via the Transformers library and ensuring the "how to use" button appears correctly.
  • Integrating a detailed overview of the "Intuitor" method and the "Reinforcement Learning from Internal Feedback (RLIF)" framework, including key highlights and relevant figures from the paper's GitHub repository.
  • Including a direct link to the official GitHub repository (https://github.com/sunblaze-ucb/rlif), providing easy access to the source code and further resources.
  • Adding a comprehensive sample usage snippet for text generation with the transformers library, demonstrating how to interact with the model using its chat template.
  • Incorporating benchmarks and results from the paper, giving users insight into the model's performance on various reasoning and generation tasks.
  • Adding a direct link to the paper on the Hugging Face Papers platform (https://huggingface.co/papers/2505.19590).

These updates aim to provide a more complete, accurate, and user-friendly model card for the community.

Xuandong changed pull request status to merged

Sign up or log in to comment