🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 21 items • Updated 6 days ago • 128
Scaling Pre-training to One Hundred Billion Data for Vision Language Models Paper • 2502.07617 • Published Feb 11 • 29
MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Paper • 2502.04235 • Published Feb 6 • 22
Running on CPU Upgrade 115 115 Open Chinese LLM Leaderboard 🏆 Display and filter LLM benchmark results
Running on CPU Upgrade 13k 13k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots