llama.cpp Support (See Loop-Instruct variant)

by nologik - opened 3 days ago

3 days ago

llama.cpp Support

Note: If you're looking for llama.cpp/GGUF support, please check out the Loop-Instruct variant:

👉 IQuestLab/IQuest-Coder-V1-40B-Loop-Instruct

This model features advanced loop attention mechanism with dual attention and learned gating, now fully supported in llama.cpp!

Sizes: F16 (75GB), Q8_0 (40GB), Q5_K_M (27GB), Q4_K_M (23GB)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment