metadata

title: README
emoji: 🏃
colorFrom: indigo
colorTo: red
sdk: streamlit
pinned: false

Red Hat AI Build AI for your world

Red Hat AI is built on open-source innovation, driven through close collaboration with IBM and Red Hat AI research, engineering, and business units.

We strongly believe the future of AI is open and community-driven. As such, we are hosting our latest optimized models on Hugging Face, fully open for the world to use. We hope that the AI community will find our efforts useful and that our models help fuel their research and efficient AI deployments.

With Red Hat AI you can,

Leverage quantized variants of the leading open source models such as Llama, Mistral, Granite, DeepSeek, Qwen, Gemma, Phi, and many more.
Tune smaller, purpose-built models with your own data.
Quantize your models with LLM Compressor or use our pre-optimized models on HuggingFace.
Optimize inference with vLLM across any hardware and deployment scenarios.

We provide accurate model checkpoints compressed with SOTA methods ready to run in vLLM such as W4A16, W8A16, W8A8 (int8 and fp8), and many more! If you would like help quantizing a model or have a request for us to add a checkpoint, please open an issue in https://github.com/vllm-project/llm-compressor.

Learn more at https://www.redhat.com/en/products/ai