Eval request
I would like to kindly ask to eval the following models:
Finetunes:
https://huggingface.co/zerofata/GLM-4.5-Iceblink-v2-106B-A12B
https://huggingface.co/kldzj/gpt-oss-120b-heretic-v2 (reasoning)
https://huggingface.co/cerebras/GLM-4.6-REAP-218B-A32B (hybrid reasoning)
̶ ̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶T̶h̶e̶D̶r̶u̶m̶m̶e̶r̶/̶S̶k̶y̶f̶a̶l̶l̶-̶3̶6̶B̶-̶v̶2̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶T̶h̶e̶D̶r̶u̶m̶m̶e̶r̶/̶R̶i̶v̶e̶r̶m̶i̶n̶d̶-̶2̶4̶B̶-̶v̶1̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶R̶e̶a̶d̶y̶A̶r̶t̶/̶D̶a̶r̶k̶-̶N̶e̶x̶u̶s̶-̶3̶2̶B̶-̶v̶2̶.̶0̶ ̶(hybrid-̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶E̶w̶e̶r̶e̶/̶Q̶w̶e̶n̶3̶-̶3̶0̶B̶-̶A̶3̶B̶-̶a̶b̶l̶i̶t̶e̶r̶a̶t̶e̶d̶-̶e̶r̶o̶t̶i̶c̶ ̶(̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶T̶h̶e̶D̶r̶u̶m̶m̶e̶r̶/̶P̶r̶e̶c̶o̶g̶-̶2̶4̶B̶-̶v̶1̶ ̶(̶t̶h̶i̶n̶k̶i̶n̶g̶)̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶T̶h̶e̶D̶r̶u̶m̶m̶e̶r̶/̶S̶n̶o̶w̶p̶i̶e̶r̶c̶e̶r̶-̶1̶5̶B̶-̶v̶4̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶c̶e̶r̶e̶b̶r̶a̶s̶/̶G̶L̶M̶-̶4̶.̶5̶-̶A̶i̶r̶-̶R̶E̶A̶P̶-̶8̶2̶B̶-̶A̶1̶2̶B̶ ̶(̶h̶y̶b̶r̶i̶d̶ ̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)̶
Base models:
https://huggingface.co/aquif-ai/aquif-3.5-Max-42B-A3B (reasoning)
https://huggingface.co/allenai/Olmo-3-32B-Think (reasoning)
https://huggingface.co/ai-sage/GigaChat3-10B-A1.8B (no reasoning)
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶L̶G̶A̶I̶-̶E̶X̶A̶O̶N̶E̶/̶E̶X̶A̶O̶N̶E̶-̶4̶.̶0̶-̶3̶2̶B̶ ̶(̶h̶y̶b̶r̶i̶d̶ ̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)̶
h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶M̶i̶n̶i̶M̶a̶x̶A̶I̶/̶M̶i̶n̶i̶M̶a̶x̶-̶M̶2̶ ̶(̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶m̶o̶o̶n̶s̶h̶o̶t̶a̶i̶/̶K̶i̶m̶i̶-̶K̶2̶-̶T̶h̶i̶n̶k̶i̶n̶g̶ ̶(̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)̶
̶h̶t̶t̶p̶s̶:̶/̶/̶h̶u̶g̶g̶i̶n̶g̶f̶a̶c̶e̶.̶c̶o̶/̶n̶v̶i̶d̶i̶a̶/̶Q̶w̶e̶n̶3̶-̶N̶e̶m̶o̶t̶r̶o̶n̶-̶3̶2̶B̶-̶R̶L̶B̶F̶F̶ ̶ ̶(̶r̶e̶a̶s̶o̶n̶i̶n̶g̶)̶
Thank you very much for your amazing work!
Another vote for MiniMax M2)
MiniMax M2, turned out to be a huge disappointment. I used it a bit for RP and yeah, had that feeling.
Ewere/Qwen3-30B-A3B-abliterated-erotic W 9.5/10 is a hidden gem for its speed, at this time, it's the highest scoring MoE model that can run on a single consumer GPU.
Can't wait to see how the other models perform!
TheDrummer released a bunch of new, very interesting finetunes!
- https://huggingface.co/TheDrummer/Precog-123B-v1 (thinking)
- https://huggingface.co/TheDrummer/Precog-24B-v1 (thinking)
- https://huggingface.co/TheDrummer/Snowpiercer-15B-v4 (long thinking, GGUF here: https://huggingface.co/TheDrummer/Snowpiercer-15B-v4-GGUF)
- https://huggingface.co/TheDrummer/Rivermind-24B-v1
I wonder how they will perform! (edited the first comment to add them)