NM Testing

company

AI & ML interests

None defined yet.

Recent Activity

nm-autobot updated a model about 19 hours ago

nm-testing/w8a8_static_asym-e2e

nm-autobot updated a model about 19 hours ago

nm-testing/w8a8_dynamic_asym-e2e

nm-autobot updated a model about 19 hours ago

nm-testing/w8a16_grouped_quant-e2e

View all activity

nm-testing 's models 523

nm-testing/paligemma-3b-mix-224-FP8-dynamic

3B • Updated Oct 29, 2024 • 2

nm-testing/TinyLlama-1.1B-Chat-v1.0-pruned_50.2of4-compressed

0.7B • Updated Oct 25, 2024 • 1

nm-testing/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16

2B • Updated Oct 24, 2024 • 1

nm-testing/SparseLlama-3.1-8B-gsm8k-pruned.2of4-FP8

8B • Updated Oct 24, 2024 • 1

nm-testing/SparseLlama-3.1-8B-gsm8k-pruned.2of4-quantized.w8a8

8B • Updated Oct 24, 2024 • 3

nm-testing/TinyLlama-1.1B-Chat-v1.0-pruned_50.2of4-uncompressed

1B • Updated Oct 23, 2024 • 2

nm-testing/SparseLlama-3-8B-pruned_50.2of4_Fp8-compressed

5B • Updated Oct 22, 2024 • 2

nm-testing/Phi-3.5-vision-instruct-W8A8-Dynamic-Per-Token

4B • Updated Oct 17, 2024 • 5

nm-testing/Phi-3.5-vision-instruct-FP8-dynamic

4B • Updated Oct 17, 2024 • 483

nm-testing/MiniCPM-V-2_6-FP8-dynamic

8B • Updated Oct 17, 2024 • 1

nm-testing/tinyllama-one-shot-w4a16-group-packed

Text Generation • 1B • Updated Oct 10, 2024 • 2

nm-testing/llama1.1b_0.5_sparse_bitmask

Text Generation • 0.8B • Updated Oct 9, 2024 • 2

nm-testing/llama7b-one-shot-2_4-w4a16-packed

Text Generation • 1B • Updated Oct 9, 2024 • 5

nm-testing/tinyllama-one-shot-w4a16-group128-packed

Text Generation • 0.3B • Updated Oct 9, 2024 • 2

nm-testing/tinyllama-one-shot-w4a16-channel-packed

Text Generation • 0.3B • Updated Oct 9, 2024 • 2

nm-testing/tinyllama-one-shot-w4a16-channel-compressed

Text Generation • 1B • Updated Oct 9, 2024 • 2

nm-testing/tinyllama-one-shot-dynamic-test

Text Generation • 1B • Updated Oct 9, 2024 • 21

nm-testing/tinyllama-one-shot-static-quant-test-compressed

Text Generation • 1B • Updated Oct 9, 2024 • 21

nm-testing/asym-w8w8-int8-static-per-tensor-tiny-llama

1B • Updated Oct 9, 2024 • 7.45k

nm-testing/tinyllama-oneshot-w8a8-channel-dynamic-token-v2-asym

1B • Updated Oct 9, 2024 • 60

nm-testing/OLMoE-1B-7B-0924-Instruct-FP8

7B • Updated Oct 9, 2024 • 4

nm-testing/DeepSeek-Coder-V2-Lite-Instruct-W8A8

16B • Updated Oct 9, 2024 • 3

nm-testing/tinyllama-w8a8-compressed

1B • Updated Oct 9, 2024 • 578

nm-testing/tinyllama-w4a16-compressed

1B • Updated Oct 9, 2024 • 1.01k

nm-testing/tinyllama-fp8-dynamic-compressed

1B • Updated Oct 9, 2024 • 313

nm-testing/SmolLM-1.7B-Instruct-quantized.w4a16

Text Generation • 2B • Updated Oct 9, 2024 • 731k

nm-testing/SmolLM-360M-Instruct-quantized.w4a16

0.4B • Updated Oct 9, 2024 • 2

nm-testing/SmolLM-135M-Instruct-quantized.w4a16

Text Generation • 0.2B • Updated Oct 9, 2024 • 2

nm-testing/Mixtral-8x7B-Instruct-v0.1-W4A16-channel-quantized

47B • Updated Oct 9, 2024 • 2

nm-testing/Meta-Llama-3-8B-Instruct-fp8-compressed

8B • Updated Oct 9, 2024 • 1