MiniMaxAI
/

MiniMax-M1-40k

Text Generation

Model card Files Files and versions

realolipop commited on 9 days ago

Commit

e283307

·

verified ·

1 Parent(s): 5eeb47c

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -120,6 +120,7 @@ foundation for next-generation language model agents to reason and tackle real-w
 | ***General Assistant***| MultiChallenge | 44.7 | 44.7 | 40.0 | 45.0 | 40.7 | 43.0 | 45.8 | 51.8 | 56.5 |
 \* conducted on the text-only HLE subset.
 ### SWE-bench methodology
 We report results derived from the Agentless scaffold. Departing from the original pipeline, our methodology employs a two-stage localization process (without any embedding-based retrieval mechanisms): initial coarse-grained file localization followed by fine-grained localization to specific files and code elements. The values for our models are calculated on the subset of n=486 verified tasks which work on our infrastructure. The excluded 14 test cases that were incompatible with our internal infrastructure are:

 | ***General Assistant***| MultiChallenge | 44.7 | 44.7 | 40.0 | 45.0 | 40.7 | 43.0 | 45.8 | 51.8 | 56.5 |
 \* conducted on the text-only HLE subset.
+Our models are evaluated with temperature=1.0, top_p=0.95.
 ### SWE-bench methodology
 We report results derived from the Agentless scaffold. Departing from the original pipeline, our methodology employs a two-stage localization process (without any embedding-based retrieval mechanisms): initial coarse-grained file localization followed by fine-grained localization to specific files and code elements. The values for our models are calculated on the subset of n=486 verified tasks which work on our infrastructure. The excluded 14 test cases that were incompatible with our internal infrastructure are: