nm-testing/llama-3-instruct-fp8-dynamic-shared-scales
Text Generation
•
8B
•
Updated
•
3
nm-testing/opt-125m-fp8-dynamic
Text Generation
•
0.1B
•
Updated
•
4
nm-testing/opt-125m-fp8-static
Text Generation
•
0.1B
•
Updated
•
5
nm-testing/mistral-fp8-dynamic
Text Generation
•
7B
•
Updated
•
6
nm-testing/mistral-fp8-static
Text Generation
•
7B
•
Updated
•
4
nm-testing/tinyllama-one-shot-static-quant-test
Text Generation
•
1B
•
Updated
•
2
nm-testing/llama2.c-stories110M-pruned50-compressed-tensors
Text Generation
•
0.1B
•
Updated
•
22
nm-testing/Nous-Hermes-Llama2-13b-smoothquant
Text Generation
•
13B
•
Updated
•
2
nm-testing/llama2-7B-sparse70-retrained-ultrachat200k-pruned70-smoothquant-ds
Text Generation
•
Updated
•
2
nm-testing/tiny_starcoder_py-quant
Text Generation
•
Updated
•
2
nm-testing/opt-125m-pruned2.4
Text Generation
•
Updated
•
2
nm-testing/llama2.c-stories42M-pruned2.4
Text Generation
•
Updated
•
915
nm-testing/llama2.c-stories15M-pruned2.4
Text Generation
•
Updated
•
2
nm-testing/Llama-2-7b-pruned50-retrained
Text Generation
•
7B
•
Updated
•
6
nm-testing/zephyr-beta-7b-gptq-g128
Text Generation
•
1B
•
Updated
•
10
nm-testing/Llama-2-7b-pruned40-retrained
Text Generation
•
Updated
•
4
nm-testing/llama2-7b-gsm8k-pt-pruned50-quant-ds
Text Generation
•
Updated
•
13
nm-testing/zephyr-50sparse-24
Text Generation
•
Updated
•
4
nm-testing/TinyLlama-1.1B-intermediate-step-1431k-3T-gsms8k-pruned50-quant-ds
Text Generation
•
Updated
•
3
nm-testing/TinyLlama-1.1B-Chat-v1.0-gsm8k-pruned50-quant-ds
Text Generation
•
Updated
•
3
nm-testing/TinyLlama-1.1B-Chat-v1.0-open_platypus-pruned50-quant-ds
Updated
nm-testing/openchat-3.5-0106-pruned50-quant-ds
Text Generation
•
Updated
•
4
nm-testing/mistral-7b-dpo-merge-v1.1-pruned50-quant-ds
Text Generation
•
Updated
•
4
nm-testing/Llama-2-7B-Chat-marlin
Text Generation
•
1B
•
Updated
•
3
nm-testing/starcoderbase-1b-pruned50-quant
Text Generation
•
Updated
•
3
nm-testing/TinyLlama-1.1B-Chat-v1.0-pruned50-quant-ds-v2
Text Generation
•
Updated
•
3
nm-testing/TinyLlama-1.1B-orca-v1.0-pruned50-quant-ds
Text Generation
•
Updated
•
3
nm-testing/zyte-1B-pruned50-quant-ds
Text Generation
•
Updated
•
3
nm-testing/MiniChat-2-3B-pruned50-ds
Text Generation
•
Updated
•
3
nm-testing/Nous-Hermes-2-SOLAR-10.7B-pruned50-ds
Text Generation
•
Updated
•
4