Update README.md
Browse files
README.md
CHANGED
@@ -34,7 +34,7 @@ The benchmark results demonstrate a level of performance that significantly surp
|
|
34 |
|
35 |
| Model | HumanEval Pass@1 Score | Note |
|
36 |
| :---------------------------------- | :--------------------: | :--------------------- |
|
37 |
-
| **moelanoby/phi3-M3-V2 (This Model)** | **95.12%/98.17%/98.56%** | **Commercial License** and they are ordered with 0,1,2 self corrections |
|
38 |
| GPT-4.5 / "Orion" | `~96.00%` | Projected (Late 2025) |
|
39 |
| Gemini 2.5 Pro | `~95.00%` | Projected (Late 2025) |
|
40 |
| Claude 4 | `~94.00%` | Projected (Late 2025) |
|
|
|
34 |
|
35 |
| Model | HumanEval Pass@1 Score | Note |
|
36 |
| :---------------------------------- | :--------------------: | :--------------------- |
|
37 |
+
| **moelanoby/phi3-M3-V2 (This Model)** | **95.12%/98.17%/98.56%** | **Commercial License** and they are ordered with 0,1,2 self corrections with 1 being the default |
|
38 |
| GPT-4.5 / "Orion" | `~96.00%` | Projected (Late 2025) |
|
39 |
| Gemini 2.5 Pro | `~95.00%` | Projected (Late 2025) |
|
40 |
| Claude 4 | `~94.00%` | Projected (Late 2025) |
|