Blog, Articles, and discussions

Introducing RTEB: A New Standard for Retrieval Evaluation

By October 1, 2025 guest • 105

Community Articles

view all

Visualizing How VLMs Work

and 1 other •

6 days ago

• 29

mem-agent: Equipping LLM Agents with Memory Using RL

and 1 other •

3 days ago

• 17

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

13 days ago

• 28

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 232

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 40

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

11 days ago

• 16

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 364

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 75

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

26 days ago

• 17

The Past and Present of Sparse Retrieval

•

9 days ago

• 4

Vocabulary is the most important element of Sparse Retrieval

•

8 days ago

• 6

Smol2Operator: Post-Training GUI Agents for Computer Use

By September 23, 2025 • 115

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 110

Welcome EmbeddingGemma, Google's new efficient embedding model

By September 4, 2025 • 235

Introducing AI Sheets: a tool to work with datasets using open AI models!

By August 8, 2025 • 100

🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?

By August 12, 2025 • 16

Welcome GPT OSS, the new open-source model family from OpenAI!

By August 5, 2025 • 498

Build an AI Shopping Assistant with Gradio MCP Servers

By July 31, 2025 • 58

Five Big Improvements to Gradio MCP Servers

By July 17, 2025 • 24

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 71

ScreenEnv: Deploy your full stack Desktop Agent

By July 10, 2025 • 71

Building the Hugging Face MCP Server

By July 10, 2025 • 66

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By July 9, 2025 • 687

SmolLM3: smol, multilingual, long-context reasoner

By July 8, 2025 • 695

Efficient MultiModal Data Pipeline

By July 8, 2025 • 56

Community Articles

Visualizing How VLMs Work

and 1 other •

6 days ago

• 29

mem-agent: Equipping LLM Agents with Memory Using RL

and 1 other •

3 days ago

• 17

How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons

•

13 days ago

• 28

BigCodeArena: Judging code generations end to end with code executions

•

6 days ago

• 16

ModernVBERT: Towards Smaller Visual Document Retrievers

and 4 others •

10 days ago

• 39

Ethics + Sustainability = Responsible AI

and 1 other •

3 days ago

• 8

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 694

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 143

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 86

Reactive Transformer (RxT): Fixing the Memory Problem in Conversational AI

•

4 days ago

• 5

🛠 ML-Agents Tips & Lessons Learned (AutoMind + MLE-Bench)

•

3 days ago

• 5

Code a simple RAG from scratch

•

Oct 29, 2024

• 217

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 232

From GRPO to DAPO and GSPO: What, Why, and How

•

Aug 9

• 40

CU-1 for Autonomous UI Agent Systems: An Open Alternative to Proprietary Solutions

•

11 days ago

• 16

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 364

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 75

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

and 4 others •

26 days ago

• 17

The Past and Present of Sparse Retrieval

•

9 days ago

• 4

Vocabulary is the most important element of Sparse Retrieval

•

8 days ago

• 6

View all