Blog, Articles, and discussions

TextQuests: How Good are LLMs at Text-Based Video Games?

By August 12, 2025 • 4

Community Articles

view all

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

1 day ago

• 24

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

•

about 23 hours ago

• 9

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

4 days ago

• 9

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 647

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 203

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

•

4 days ago

• 8

From GRPO to DAPO and GSPO: What, Why, and How

•

4 days ago

• 8

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

9 days ago

• 23

The Missing Semester of AI for Organizations #1: LLM Security

•

7 days ago

• 7

Luth: Efficient French Specialization for Small Language Models

and 1 other •

2 days ago

• 6

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

•

8 days ago

• 7

Introducing : 🤏🏻🏭SmolFactory

•

3 days ago

• 5

Towards Open Evolutionary Agents

and 1 other •

9 days ago

• 13

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

•

4 days ago

• 5

How I Built 7 Custom Gradio Components in Just 12 Days!

•

about 22 hours ago

• 5

Introducing AI Sheets: a tool to work with datasets using open AI models!

By August 8, 2025 • 44

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

By August 8, 2025 • 38

🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?

By August 12, 2025 • 4

Vision Language Model Alignment in TRL ⚡️

By August 7, 2025 • 40

Welcome GPT OSS, the new open-source model family from OpenAI!

By August 5, 2025 • 447

Build an AI Shopping Assistant with Gradio MCP Servers

By July 31, 2025 • 36

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

By July 29, 2025 • 149

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

By July 25, 2025 • 73

Parquet Content-Defined Chunking

By July 25, 2025 • 54

TimeScope: How Long Can Your Video Large Multimodal Model Go?

By July 23, 2025 • 35

Fast LoRA inference for Flux with Diffusers and PEFT

By July 23, 2025 • 42

Arc Virtual Cell Challenge: A Primer

By July 18, 2025 • 50

Consilium: When Multiple LLMs Collaborate

By July 17, 2025 guest • 21

Back to The Future: Evaluating AI Agents on Predicting Future Events

By July 17, 2025 guest • 34

Community Articles

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

and 4 others •

1 day ago

• 24

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

•

6 days ago

• 22

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

and 9 others •

2 days ago

• 19

The GPT-OSS models are here… and they’re energy-efficient!

•

6 days ago

• 15

Code a simple RAG from scratch

•

Oct 29, 2024

• 150

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

4 days ago

• 11

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

•

about 23 hours ago

• 9

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

4 days ago

• 9

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 647

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 203

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

•

4 days ago

• 8

From GRPO to DAPO and GSPO: What, Why, and How

•

4 days ago

• 8

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

9 days ago

• 23

The Missing Semester of AI for Organizations #1: LLM Security

•

7 days ago

• 7

Luth: Efficient French Specialization for Small Language Models

and 1 other •

2 days ago

• 6

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

•

8 days ago

• 7

Introducing : 🤏🏻🏭SmolFactory

•

3 days ago

• 5

Towards Open Evolutionary Agents

and 1 other •

9 days ago

• 13

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

•

4 days ago

• 5

How I Built 7 Custom Gradio Components in Just 12 Days!

•

about 22 hours ago

• 5

View all

Blog, Articles, and discussions

TextQuests: How Good are LLMs at Text-Based Video Games?

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

The GPT-OSS models are here… and they’re energy-efficient!

Code a simple RAG from scratch

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

Uncensor any LLM with abliteration

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

From GRPO to DAPO and GSPO: What, Why, and How

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

The Missing Semester of AI for Organizations #1: LLM Security

Luth: Efficient French Specialization for Small Language Models

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

Introducing : 🤏🏻🏭SmolFactory

Towards Open Evolutionary Agents

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

How I Built 7 Custom Gradio Components in Just 12 Days!

Introducing AI Sheets: a tool to work with datasets using open AI models!

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

🇵🇭 FilBench - Can LLMs Understand and Generate Filipino?

Vision Language Model Alignment in TRL ⚡️

Welcome GPT OSS, the new open-source model family from OpenAI!

Build an AI Shopping Assistant with Gradio MCP Servers

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Say hello to `hf`: a faster, friendlier Hugging Face CLI ✨

Parquet Content-Defined Chunking

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Fast LoRA inference for Flux with Diffusers and PEFT

Arc Virtual Cell Challenge: A Primer

Consilium: When Multiple LLMs Collaborate

Back to The Future: Evaluating AI Agents on Predicting Future Events

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

The GPT-OSS models are here… and they’re energy-efficient!

Code a simple RAG from scratch

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

Uncensor any LLM with abliteration

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

From GRPO to DAPO and GSPO: What, Why, and How

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

The Missing Semester of AI for Organizations #1: LLM Security

Luth: Efficient French Specialization for Small Language Models

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

Introducing : 🤏🏻🏭SmolFactory

Towards Open Evolutionary Agents

What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2

How I Built 7 Custom Gradio Components in Just 12 Days!

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?