Blog, Articles, and discussions

TextQuests: How Good are LLMs at Text-Based Video Games?

By August 12, 2025 guest • 15

Community Articles

view all

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

3 days ago

• 41

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

•

2 days ago

• 10

Announcing the Synthetic Online Conversations Dataset (SOC)

•

2 days ago

• 10

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 649

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 205

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

5 days ago

• 9

From GRPO to DAPO and GSPO: What, Why, and How

•

5 days ago

• 8

Luth: Efficient French Specialization for Small Language Models

and 1 other •

3 days ago

• 8

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

6 days ago

• 7

The Missing Semester of AI for Organizations #1: LLM Security

•

8 days ago

• 8

Introducing : 🤏🏻🏭SmolFactory

•

4 days ago

• 5

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

•

6 days ago

• 5

How I Built 7 Custom Gradio Components in Just 12 Days!

•

2 days ago

• 5

How to Run a Hugging Face Model in JAX (Part 3)

•

2 days ago

• 5

SmolVLM2: Bringing Video Understanding to Every Device

By February 20, 2025 guest • 293

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

By February 19, 2025 • 70

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

By February 18, 2025 • 100

Welcome Fireworks.ai on the Hub 🎆

By February 14, 2025 • 59

Fixing Open LLM Leaderboard with Math-Verify

By February 14, 2025 • 30

1 Billion Classifications

By February 13, 2025 guest • 44

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

By February 12, 2025 • 72

Build awesome datasets for video generation

By February 12, 2025 • 34

The Open Arabic LLM Leaderboard 2

By February 10, 2025 guest • 35

Open-source DeepResearch – Freeing our search agents

By February 4, 2025 • 1.28k

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

By February 4, 2025 • 169

DABStep: Data Agent Benchmark for Multi-step Reasoning

By February 4, 2025 guest • 100

The AI tools for Art Newsletter - Issue 1

By January 31, 2025 • 83

How to deploy and fine-tune DeepSeek models on AWS

By January 30, 2025 • 52

Community Articles

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

3 days ago

• 41

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

•

7 days ago

• 22

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

3 days ago

• 19

The GPT-OSS models are here… and they’re energy-efficient!

•

7 days ago

• 16

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

5 days ago

• 12

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

•

6 days ago

• 11

Code a simple RAG from scratch

•

Oct 29, 2024

• 152

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

•

2 days ago

• 10

Announcing the Synthetic Online Conversations Dataset (SOC)

•

2 days ago

• 10

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 649

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 205

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

5 days ago

• 9

From GRPO to DAPO and GSPO: What, Why, and How

•

5 days ago

• 8

Luth: Efficient French Specialization for Small Language Models

and 1 other •

3 days ago

• 8

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

•

6 days ago

• 7

The Missing Semester of AI for Organizations #1: LLM Security

•

8 days ago

• 8

Introducing : 🤏🏻🏭SmolFactory

•

4 days ago

• 5

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

•

6 days ago

• 5

How I Built 7 Custom Gradio Components in Just 12 Days!

•

2 days ago

• 5

How to Run a Hugging Face Model in JAX (Part 3)

•

2 days ago

• 5

View all

Blog, Articles, and discussions

TextQuests: How Good are LLMs at Text-Based Video Games?

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

The GPT-OSS models are here… and they’re energy-efficient!

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Code a simple RAG from scratch

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

Announcing the Synthetic Online Conversations Dataset (SOC)

Uncensor any LLM with abliteration

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

From GRPO to DAPO and GSPO: What, Why, and How

Luth: Efficient French Specialization for Small Language Models

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

The Missing Semester of AI for Organizations #1: LLM Security

Introducing : 🤏🏻🏭SmolFactory

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

How I Built 7 Custom Gradio Components in Just 12 Days!

How to Run a Hugging Face Model in JAX (Part 3)

SmolVLM2: Bringing Video Understanding to Every Device

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

Welcome Fireworks.ai on the Hub 🎆

Fixing Open LLM Leaderboard with Math-Verify

1 Billion Classifications

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

Build awesome datasets for video generation

The Open Arabic LLM Leaderboard 2

Open-source DeepResearch – Freeing our search agents

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

DABStep: Data Agent Benchmark for Multi-step Reasoning

The AI tools for Art Newsletter - Issue 1

How to deploy and fine-tune DeepSeek models on AWS

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

The GPT-OSS models are here… and they’re energy-efficient!

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Code a simple RAG from scratch

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

Announcing the Synthetic Online Conversations Dataset (SOC)

Uncensor any LLM with abliteration

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

From GRPO to DAPO and GSPO: What, Why, and How

Luth: Efficient French Specialization for Small Language Models

What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware

The Missing Semester of AI for Organizations #1: LLM Security

Introducing : 🤏🏻🏭SmolFactory

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

How I Built 7 Custom Gradio Components in Just 12 Days!

How to Run a Hugging Face Model in JAX (Part 3)

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?