Is the GGUF file still being uploaded?

by lingyezhixing - opened Jul 9

Discussion

lingyezhixing

Jul 9

The file list is empty, is the repository created by mistake or the file has not been uploaded?

qingy2024

Jul 9

•

edited Jul 9

UD Q3_K_XL has just been uploaded one minute ago :)

Hopefully all should be here in the next hour or so because the imatrix takes a while but the quants should be quick.

danielhanchen

Unsloth AI org Jul 9

All done sorry! I was investigating why the perplexity of the model was super high - 180 to 300. It should be OK I think - I verified after using it a bit. Please use:

./llama.cpp/llama-cli -hf unsloth/Hunyuan-A13B-Instruct-GGUF:Q4_K_XL -ngl 99 --jinja --temp 0.7 --top-k 20 --top-p 0.8 --repeat-penalty 1.05

imoc

Jul 9

haha it is that high, see https://github.com/ggml-org/llama.cpp/pull/14425, making all confused

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment