Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
wangbing1416 's Collections
RLHF
Reasoning Papers

Reasoning Papers

updated 2 days ago
Upvote
-

  • Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization

    Paper • 2508.07629 • Published 11 days ago • 39

  • Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning

    Paper • 2508.07101 • Published 12 days ago • 13

  • Compressing Chain-of-Thought in LLMs via Step Entropy

    Paper • 2508.03346 • Published 17 days ago • 7

  • Train Long, Think Short: Curriculum Learning for Efficient Reasoning

    Paper • 2508.08940 • Published 10 days ago • 22

  • Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning

    Paper • 2508.09726 • Published 9 days ago • 11

  • Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

    Paper • 2508.10751 • Published 8 days ago • 24

  • Beyond Solving Math Quiz: Evaluating the Ability of Large Reasoning Models to Ask for Information

    Paper • 2508.11252 • Published 7 days ago • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs