Blog, Articles, and discussions

Training and Finetuning Sparse Embedding Models with Sentence Transformers v5

By July 1, 2025 • 74

Community Articles

view all

Bringing Fusion Down to Earth: ML for Stellarator Optimization

•

4 days ago

• 56

Teaching Data Literacy with Hugging Face's AI Sheets

•

6 days ago

• 23

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

and 2 others •

4 days ago

• 21

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

and 3 others •

4 days ago

• 7

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 303

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 175

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 621

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 270

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 347

Code a simple RAG from scratch

•

Oct 29, 2024

• 116

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 77

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

14 days ago

• 55

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

•

7 days ago

• 6

Training a language model with 🤗 Transformers using TensorFlow and TPUs

By April 27, 2023

How to host a Unity game in a Space

By April 21, 2023 • 5

Graph Classification with Transformers

By April 14, 2023 • 4

Accelerating Stable Diffusion Inference on Intel CPUs

By March 28, 2023 • 2

Federated Learning using Hugging Face and Flower

By March 27, 2023 guest

Train your ControlNet with diffusers

By March 24, 2023 • 34

Multivariate Probabilistic Time Series Forecasting with Informer

By March 10, 2023 • 21

New ViT and ALIGN Models From Kakao Brain

By March 6, 2023 • 3

Zero-shot image-to-text generation with BLIP-2

By February 15, 2023 • 21

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

By February 10, 2023 • 86

Speech Synthesis, Recognition, and More With SpeechT5

By February 8, 2023 • 12

Generating Stories: AI for Game Development #5

By February 7, 2023

Accelerating PyTorch Transformers with Intel Sapphire Rapids, part 2

By February 6, 2023

A Dive into Pretraining Strategies for Vision-Language Models

By February 3, 2023 • 69

Community Articles

Bringing Fusion Down to Earth: ML for Stellarator Optimization

•

4 days ago

• 56

Teaching Data Literacy with Hugging Face's AI Sheets

•

6 days ago

• 23

Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them)

and 2 others •

4 days ago

• 21

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

9 days ago

• 21

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

and 10 others •

8 days ago

• 23

Why We Built the OpenMDW License: A Comprehensive License for ML Models

•

3 days ago

• 10

IFAD AI Benchmark (Garden V1)

and 8 others •

5 days ago

• 9

Automated Discovery of High-Performance GPU Kernels with OpenEvolve

•

8 days ago

• 16

Should We Still Pretrain Encoders with Masked Language Modeling?

and 3 others •

3 days ago

• 9

How Much Power does a SOTA Open Video Model Use? ⚡🎥

and 2 others •

3 days ago

• 8

LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs

and 3 others •

4 days ago

• 7

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 303

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 175

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 621

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 270

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

•

Jul 29, 2024

• 347

Code a simple RAG from scratch

•

Oct 29, 2024

• 116

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 77

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

and 1 other •

14 days ago

• 55

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

•

7 days ago

• 6

View all