xl-zhao commited on
Commit
fdabdf1
Β·
verified Β·
1 Parent(s): 0d67319

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -21,6 +21,16 @@ For more details, refer to our **paper on ArXiv**: [πŸ”— PromptCoT: Synthesizing
21
 
22
  ---
23
 
 
 
 
 
 
 
 
 
 
 
24
 
25
  ## πŸ”₯ **Quick Start: Using the Model**
26
 
@@ -85,7 +95,7 @@ print(outputs[0].outputs[0].text)
85
  ---
86
 
87
  ## πŸ”— **Full Usage & Advanced Options**
88
- For advanced usage, including **batch inference and evaluation on mathematical benchmarks, refer to the **full repository on GitHub**:
89
  πŸ”Ή [GitHub: PromptCoT](https://github.com/zhaoxlpku/PromptCoT)
90
 
91
  ---
 
21
 
22
  ---
23
 
24
+ ## πŸ† State-of-the-Art Performance
25
+ **PromptCoT-QwQ-32B** has achieved remarkable results, outperforming all competitors across key benchmarks focused on mathematical reasoning:
26
+
27
+ | **Model** | **GSM8K** | **MATH-500** | **AIME2024** | **AIME2025** |
28
+ | --- | --- | --- | --- | --- |
29
+ | **S1-32B** | - | 93.0% | 56.7% | 26.6% |
30
+ | **LIMO-32B** | - | 94.8% | 57.1% | 46.6% |
31
+ | **QwQ-32B** | - | - | 82.1% | 70.8% |
32
+ | **PromptCoT-QwQ-32B** (**ours**) | πŸ”₯ **96.4% Β± 0.2%** | πŸ”₯ **96.7% Β± 0.5%** | πŸ”₯ **83.8% Β± 2.8%** | πŸ”₯ **75.4% Β± 4.7%** |
33
+
34
 
35
  ## πŸ”₯ **Quick Start: Using the Model**
36
 
 
95
  ---
96
 
97
  ## πŸ”— **Full Usage & Advanced Options**
98
+ For advanced usage, including batch inference and evaluation on mathematical benchmarks, refer to the **full repository on GitHub**:
99
  πŸ”Ή [GitHub: PromptCoT](https://github.com/zhaoxlpku/PromptCoT)
100
 
101
  ---