Update README.md
Browse files
README.md
CHANGED
@@ -136,11 +136,12 @@ It also closes the gap to proprietary Claude models.
|
|
136 |
| Qwen2.5-VL-32B | 46.5 | 87.0 | 39.4 |
|
137 |
| UI-TARS-72B | 57.1 | 90.3 | 38.1 |
|
138 |
| **OpenCUA-A3B** | 48.6 | 91.4 | 28.5 |
|
139 |
-
| **OpenCUA-7B** | 45.7 | 88.5 | 23.7 |
|
140 |
-
| **OpenCUA-
|
141 |
-
| **OpenCUA-
|
142 |
</div>
|
143 |
|
|
|
144 |
### AgentNetBench (Offline Evaluation)
|
145 |
<div align="center">
|
146 |
|
@@ -150,8 +151,8 @@ It also closes the gap to proprietary Claude models.
|
|
150 |
| Qwen2.5-VL-32B | 66.6 | 47.2 | 41.5 | 64.8 |
|
151 |
| Qwen2.5-VL-72B | 67.2 | 52.6 | 50.5 | 67.0 |
|
152 |
| OpenAI CUA | 71.7 | 57.3 | **80.0** | 73.1 |
|
153 |
-
| **OpenCUA-
|
154 |
-
| **OpenCUA-
|
155 |
</div>
|
156 |
|
157 |
# 🚀 Quick Start
|
|
|
136 |
| Qwen2.5-VL-32B | 46.5 | 87.0 | 39.4 |
|
137 |
| UI-TARS-72B | 57.1 | 90.3 | 38.1 |
|
138 |
| **OpenCUA-A3B** | 48.6 | 91.4 | 28.5 |
|
139 |
+
| **OpenCUA-Qwen2-7B** | 45.7 | 88.5 | 23.7 |
|
140 |
+
| **OpenCUA-7B** | 55.3 | 92.3 | 50.0 |
|
141 |
+
| **OpenCUA-32B** | **59.6** | **93.4** | **55.3** |
|
142 |
</div>
|
143 |
|
144 |
+
|
145 |
### AgentNetBench (Offline Evaluation)
|
146 |
<div align="center">
|
147 |
|
|
|
151 |
| Qwen2.5-VL-32B | 66.6 | 47.2 | 41.5 | 64.8 |
|
152 |
| Qwen2.5-VL-72B | 67.2 | 52.6 | 50.5 | 67.0 |
|
153 |
| OpenAI CUA | 71.7 | 57.3 | **80.0** | 73.1 |
|
154 |
+
| **OpenCUA-7B** | 79.0 | 62.0 | 44.3 | 75.2 |
|
155 |
+
| **OpenCUA-32B** | **81.9** | 66.1 | 55.7 | **79.1** |
|
156 |
</div>
|
157 |
|
158 |
# 🚀 Quick Start
|