Blog, Articles, and discussions

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By July 16, 2025 • 58

Community Articles

view all

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

5 days ago

• 49

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

•

4 days ago

• 10

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

6 days ago

• 10

Announcing the Synthetic Online Conversations Dataset (SOC)

•

4 days ago

• 10

From GRPO to DAPO and GSPO: What, Why, and How

•

7 days ago

• 9

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 650

Code a simple RAG from scratch

•

Oct 29, 2024

• 154

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

•

9 days ago

• 23

Luth: Efficient French Specialization for Small Language Models

and 1 other •

5 days ago

• 8

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 206

The Missing Semester of AI for Organizations #1: LLM Security

•

10 days ago

• 8

The GPT-OSS models are here… and they’re energy-efficient!

•

9 days ago

• 16

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

8 days ago

• 7

Kimina-Prover-RL

and 18 others •

1 day ago

• 6

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 113

Introducing : 🤏🏻🏭SmolFactory

•

5 days ago

• 5

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

By March 15, 2024 guest • 10

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

By March 15, 2024 • 11

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

By February 29, 2024 guest • 1

StarCoder2 and The Stack v2

By February 28, 2024 • 9

AI Watermarking 101: Tools and Techniques

By February 26, 2024 • 21

Fine-Tuning Gemma Models in Hugging Face

By February 23, 2024 guest • 37

🪆 Introduction to Matryoshka Embedding Models

By February 23, 2024 • 154

Welcome Gemma - Google's new open LLM

By February 21, 2024 • 25

🤗 PEFT welcomes new merging methods

By February 19, 2024 • 22

Synthetic data: save money, time and carbon with open source

By February 16, 2024 • 78

From OpenAI to Open LLMs with Messages API

By February 8, 2024 • 20

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

By January 30, 2024 guest • 9

Open-source LLMs as LangChain Agents

By January 24, 2024 • 69

Preference Tuning LLMs with Direct Preference Optimization Methods

By January 18, 2024 • 70

Community Articles

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

5 days ago

• 49

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

5 days ago

• 20

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

7 days ago

• 13

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

•

1 day ago

• 12

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

•

7 days ago

• 12

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

•

4 days ago

• 10

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

6 days ago

• 10

Announcing the Synthetic Online Conversations Dataset (SOC)

•

4 days ago

• 10

From GRPO to DAPO and GSPO: What, Why, and How

•

7 days ago

• 9

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 650

Code a simple RAG from scratch

•

Oct 29, 2024

• 154

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

•

9 days ago

• 23

Luth: Efficient French Specialization for Small Language Models

and 1 other •

5 days ago

• 8

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 206

The Missing Semester of AI for Organizations #1: LLM Security

•

10 days ago

• 8

The GPT-OSS models are here… and they’re energy-efficient!

•

9 days ago

• 16

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

8 days ago

• 7

Kimina-Prover-RL

and 18 others •

1 day ago

• 6

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 113

Introducing : 🤏🏻🏭SmolFactory

•

5 days ago

• 5

View all

Blog, Articles, and discussions

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

Announcing the Synthetic Online Conversations Dataset (SOC)

From GRPO to DAPO and GSPO: What, Why, and How

Uncensor any LLM with abliteration

Code a simple RAG from scratch

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

Luth: Efficient French Specialization for Small Language Models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

The Missing Semester of AI for Organizations #1: LLM Security

The GPT-OSS models are here… and they’re energy-efficient!

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Kimina-Prover-RL

KV Caching Explained: Optimizing Transformer Inference Efficiency

Introducing : 🤏🏻🏭SmolFactory

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

StarCoder2 and The Stack v2

AI Watermarking 101: Tools and Techniques

Fine-Tuning Gemma Models in Hugging Face

🪆 Introduction to Matryoshka Embedding Models

Welcome Gemma - Google's new open LLM

🤗 PEFT welcomes new merging methods

Synthetic data: save money, time and carbon with open source

From OpenAI to Open LLMs with Messages API

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Open-source LLMs as LangChain Agents

Preference Tuning LLMs with Direct Preference Optimization Methods

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

Announcing the Synthetic Online Conversations Dataset (SOC)

From GRPO to DAPO and GSPO: What, Why, and How

Uncensor any LLM with abliteration

Code a simple RAG from scratch

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

Luth: Efficient French Specialization for Small Language Models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

The Missing Semester of AI for Organizations #1: LLM Security

The GPT-OSS models are here… and they’re energy-efficient!

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

Kimina-Prover-RL

KV Caching Explained: Optimizing Transformer Inference Efficiency

Introducing : 🤏🏻🏭SmolFactory

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?