LLM-Compe-2025-Camino/DeepSeek-R1-Distill-Llama-70B_exam_GRPO_step80_merged 71B • Updated 2 days ago • 8
LLM-Compe-2025-Camino/DeepSeek-R1-Distill-Llama-70B_Omni-MATH-5plus_N-CThink-QA_GRPO_step30_merged 71B • Updated 4 days ago • 23
LLM-Compe-2025-Camino/DeepSeek-R1-Distill-Llama-70B_Omni-MATH-5plus_N-CThink-QA_GRPO_step30 71B • Updated 4 days ago • 3
LLM-Compe-2025-Camino/Phi-4-reasoning-plus_Omni-MATH-5plus_N-CThink-QA_GRPO_short_answer_penalty_step420 15B • Updated 4 days ago • 112
LLM-Compe-2025-Camino/DeepSeek-Qwen14B_Omni-MATH-5plus_N-CThink-QA_short_answer_penalty_step210 15B • Updated 5 days ago • 105
LLM-Compe-2025-Camino/DeepSeek-Qwen14B_SuperGPQA-incorrect_GRPO_step140 15B • Updated 5 days ago • 27
LLM-Compe-2025-Camino/Phi-4-reasoning-plus_Omni-MATH-5plus_N-CThink-QA_GRPO_short_answer_penalty_step200 15B • Updated 6 days ago • 9
LLM-Compe-2025-Camino/Nemotron-CrossThink-QA_reasoning_Phi-4-reasoning-plus_0803_n20480_test_5k Viewer • Updated 9 days ago • 5.14k • 78