Community Blog & Articles

Community Articles

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation

LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric

NEO-unify: Building Native Multimodal Unified Models End to End

Uncensor any LLM with abliteration

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark

KV Caching Explained: Optimizing Transformer Inference Efficiency

Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge

SILMA TTS: A Lightweight Open Bilingual Text to Speech Model

Tokenization is Killing our Multilingual LLM Dream

Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions

We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️

From GRPO to DAPO and GSPO: What, Why, and How

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

NanoVDR: A 70M Text-Only Model That Retrieves Visual Documents as Well as a 2B VLM

Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters

Build a Domain-Specific Embedding Model in Under a Day

What's New in Mellea 0.4.0 + Granite Libraries Release

State of Open Source on Hugging Face: Spring 2026

Holotron-12B - High Throughput Computer Use Agent

hubstorageannouncement

Introducing Storage Buckets on the Hugging Face Hub

+8

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+5

guidedistributed-trainingaccelerate

Ulysses Sequence Parallelism: Training with Million-Token Contexts

lerobotrobotics

LeRobot v0.5.0: Scaling Every Dimension

+6

Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations

open-sourcediffusersgenerative

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

PRX Part 3 — Training a Text-to-Image Model in 24h!

mixture-of-expertsoptimizationtransformers

Mixture of Experts (MoEs) in Transformers

+3

February 26, 2026

communityopen-sourcellm

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+2

February 20, 2026

llmfine-tuningtraining

Train AI models with Unsloth and Hugging Face Jobs for FREE

+2

February 20, 2026

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Nemotron 3 Nano 4B: A Compact Hybrid Model for Efficient Local AI

Introducing SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding

The First Healthcare Robotics Dataset and Foundational Physical AI Models for Healthcare Robotics

Beyond Semantic Similarity: Introducing NVIDIA NeMo Retriever’s Generalizable Agentic Retrieval Pipeline

Expanding the Alpamayo Open Platform for Developing Reasoning AVs Across Models, Data, and Simulation

LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric

NEO-unify: Building Native Multimodal Unified Models End to End

Uncensor any LLM with abliteration

ATE-2: State-of-the-Art Armenian Text Embeddings and the ArmBench-TextEmbed Benchmark

KV Caching Explained: Optimizing Transformer Inference Efficiency

Granite 4.0 1B Speech: Compact, Multilingual, and Built for the Edge

SILMA TTS: A Lightweight Open Bilingual Text to Speech Model

Tokenization is Killing our Multilingual LLM Dream

Nemotron-Personas: Improve AI Training With the First Synthetic Personas Dataset Aligned to Real-World Distributions

We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️

From GRPO to DAPO and GSPO: What, Why, and How

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

NanoVDR: A 70M Text-Only Model That Retrieves Visual Documents as Well as a 2B VLM

Arabic TTS Arena: Ranking Voice Models the Way Chess Ranks Grandmasters

View all articles