Red Hat AI

company

Verified

https://www.redhat.com/en/products/ai

RedHat_AI

AI & ML interests

OpenSource and AI

Recent Activity

mgoin new activity about 6 hours ago

RedHatAI/Mistral-Small-3.2-24B-Instruct-2506-FP8:Not working with vLLM 0.9.1

nm-research updated a model 1 day ago

RedHatAI/Llama-4-Scout-17B-16E-NVFP4

nm-research updated a model 1 day ago

RedHatAI/Qwen2.5-VL-72B-Instruct-quantized.w4a16

View all activity

RedHatAI 's models 490

RedHatAI/Mixtral-8x7B-Instruct-v0.1-AutoFP8

Text Generation • 47B • Updated Jul 18, 2024 • 23 • 3

RedHatAI/Meta-Llama-3-70B-Instruct-FP8

Text Generation • 71B • Updated Jul 18, 2024 • 58.6k • 13

RedHatAI/Meta-Llama-3-8B-Instruct-FP8

Text Generation • 8B • Updated Jul 18, 2024 • 2.94k • 23

RedHatAI/DeepSeek-Coder-V2-Lite-Base-FP8

Text Generation • 16B • Updated Jul 18, 2024 • 52

RedHatAI/DeepSeek-Coder-V2-Lite-Instruct-FP8

Text Generation • 16B • Updated Jul 18, 2024 • 23.6k • 7

RedHatAI/Qwen2-7B-Instruct-quantized.w4a16

Text Generation • 2B • Updated Jul 18, 2024 • 46

RedHatAI/Qwen2-72B-Instruct-quantized.w4a16

Text Generation • 12B • Updated Jul 18, 2024 • 12 • 4

RedHatAI/Qwen2-1.5B-Instruct-quantized.w4a16

Text Generation • 0.6B • Updated Jul 18, 2024 • 24

RedHatAI/Qwen2-0.5B-Instruct-quantized.w4a16

Text Generation • 0.3B • Updated Jul 18, 2024 • 16

RedHatAI/Qwen2-72B-Instruct-quantized.w8a16

Text Generation • 20B • Updated Jul 18, 2024 • 680 • 1

RedHatAI/Qwen2-7B-Instruct-quantized.w8a16

Text Generation • 3B • Updated Jul 18, 2024 • 31

RedHatAI/Qwen2-1.5B-Instruct-quantized.w8a16

Text Generation • 0.6B • Updated Jul 18, 2024 • 12

RedHatAI/Qwen2-0.5B-Instruct-quantized.w8a16

Text Generation • 0.2B • Updated Jul 18, 2024 • 12

RedHatAI/Llama-2-7b-chat-quantized.w4a16

Text Generation • 1B • Updated Jul 18, 2024 • 16

RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w4a16

Text Generation • 2B • Updated Jul 18, 2024 • 125 • 2

RedHatAI/Llama-2-7b-chat-quantized.w8a16

Text Generation • 2B • Updated Jul 18, 2024 • 15

RedHatAI/Mistral-7B-Instruct-v0.3-quantized.w8a16

Text Generation • 2B • Updated Jul 18, 2024 • 273

RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w8a16

Text Generation • 19B • Updated Jul 18, 2024 • 1.12k • 5

RedHatAI/Meta-Llama-3-8B-Instruct-quantized.w8a16

Text Generation • 3B • Updated Jul 18, 2024 • 1.34k • 3

RedHatAI/SparseLLama-2-7b-ultrachat_200k-pruned_50.2of4

Text Generation • 7B • Updated Jul 7, 2024 • 13

RedHatAI/SparseLlama-2-7b-evolcodealpaca-pruned_50.2of4

Text Generation • 7B • Updated Jul 3, 2024 • 11

RedHatAI/Meta-Llama-3-70B-Instruct-FP8-KV

Text Generation • 71B • Updated Jun 26, 2024 • 52 • 2

RedHatAI/Llama-2-7b-gsm8k-pruned_70

Text Generation • 7B • Updated Jun 20, 2024 • 16

RedHatAI/Llama-2-7b-gsm8k-pruned_50

Text Generation • 7B • Updated Jun 20, 2024 • 16 • 1

RedHatAI/Llama-2-7b-gsm8k

Text Generation • Updated Jun 20, 2024 • 1.71k • 3

RedHatAI/Meta-Llama-3-8B-Instruct-FP8-KV

Text Generation • 8B • Updated Jun 19, 2024 • 4.56k • 8

RedHatAI/Mistral-7B-Instruct-v0.3-GPTQ-4bit

Text Generation • 1B • Updated Jun 10, 2024 • 8.72k • 19

RedHatAI/SparseLlama-2-7b-cnn-daily-mail-pruned_50.2of4

Text Generation • 7B • Updated May 21, 2024 • 11

RedHatAI/SparseLlama-2-7b-cnn-daily-mail-pruned_70

Updated May 21, 2024

RedHatAI/Llama-2-7b-cnn-daily-mail-pruned_70-quantized-deepsparse

Text Generation • Updated May 17, 2024 • 12