xl-zhao
/

PromptCoT-QwQ-32B

Model card Files Files and versions Community

xl-zhao commited on 16 days ago

Commit

fdabdf1

·

verified ·

1 Parent(s): 0d67319

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -21,6 +21,16 @@ For more details, refer to our **paper on ArXiv**: [🔗 PromptCoT: Synthesizing
 ---
 ## 🔥 **Quick Start: Using the Model**
@@ -85,7 +95,7 @@ print(outputs[0].outputs[0].text)
 ---
 ## 🔗 **Full Usage & Advanced Options**
-For advanced usage, including **batch inference and evaluation on mathematical benchmarks, refer to the **full repository on GitHub**:
 🔹 [GitHub: PromptCoT](https://github.com/zhaoxlpku/PromptCoT)
 ---

 ---
+## 🏆 State-of-the-Art Performance
+**PromptCoT-QwQ-32B** has achieved remarkable results, outperforming all competitors across key benchmarks focused on mathematical reasoning:
+| **Model** | **GSM8K** | **MATH-500** | **AIME2024** | **AIME2025** |
+| --- | --- | --- | --- | --- |
+| **S1-32B** | - | 93.0% | 56.7% | 26.6% |
+| **LIMO-32B** | - | 94.8% | 57.1% | 46.6% |
+| **QwQ-32B** | - | - | 82.1% | 70.8% |
+| **PromptCoT-QwQ-32B** (**ours**) | 🔥 **96.4% ± 0.2%** | 🔥 **96.7% ± 0.5%** | 🔥 **83.8% ± 2.8%** | 🔥 **75.4% ± 4.7%** |
 ## 🔥 **Quick Start: Using the Model**
 ---
 ## 🔗 **Full Usage & Advanced Options**
+For advanced usage, including batch inference and evaluation on mathematical benchmarks, refer to the **full repository on GitHub**:
 🔹 [GitHub: PromptCoT](https://github.com/zhaoxlpku/PromptCoT)
 ---