nm-testing/llama2.c-stories110M-gsm8k-recipe_w4a16_actorder_weight-compressed 0.1B • Updated Mar 12 • 1.83k
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-kv_cache-qkv_proj 8B • Updated Mar 10 • 4