Update README.md
Browse files
README.md
CHANGED
|
@@ -40,6 +40,15 @@ The benchmark results demonstrate a level of performance that significantly surp
|
|
| 40 |
| Gemini 2.5 Pro | `~95.00%` | Projected (Late 2025) |
|
| 41 |
| Claude 4 | `~94.00%` | Projected (Late 2025) |
|
| 42 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 43 |
---
|
| 44 |
|
| 45 |
## Support the Project
|
|
|
|
| 40 |
| Gemini 2.5 Pro | `~95.00%` | Projected (Late 2025) |
|
| 41 |
| Claude 4 | `~94.00%` | Projected (Late 2025) |
|
| 42 |
|
| 43 |
+
A more reliable benchmark is one that's made by u/Chromix_
|
| 44 |
+
|
| 45 |
+
|Test|This LLM|Phi3-Mini-Instruct|
|
| 46 |
+
|:-|:-|:-|
|
| 47 |
+
|junior-v2 Python|83|90 / 83|
|
| 48 |
+
|junior-v2 JavaScript|72|85 / 79|
|
| 49 |
+
|senior Python|25|59 / 30|
|
| 50 |
+
|senior JavaScript|39|37 / 23|
|
| 51 |
+
|
| 52 |
---
|
| 53 |
|
| 54 |
## Support the Project
|