Blog, Articles, and discussions

Community Articles

view all

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

and 2 others •

3 days ago

• 28

AI for Food Allergies

and 3 others •

7 days ago

• 27

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

and 2 others •

about 8 hours ago

• 22

Introducing the Massive Legal Embedding Benchmark (MLEB)

and 2 others •

7 days ago

• 17

How I Built Lightning-Fast Vector Search for Legal Documents

•

4 days ago

• 14

Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling

and 19 others •

3 days ago

• 14

Australian-made LLM beats OpenAI and Google at legal retrieval

and 2 others •

about 23 hours ago

• 13

GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms

and 15 others •

4 days ago

• 12

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

•

7 days ago

• 21

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

and 3 others •

3 days ago

• 11

Promoter-GPT: Writing DNA Instructions with Language Models

•

1 day ago

• 9

Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard

and 4 others •

2 days ago

• 8

There is no such thing as a tokenizer-free lunch

•

29 days ago

• 83

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

and 1 other •

1 day ago

• 6

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 238

How Financial News Can Be Used to Train Good Financial Models

•

15 days ago

• 7

Understanding Vector Quantization in VQ-VAE

•

Aug 28, 2024

• 48

partnershipsgraphcoreguide

Getting Started with Hugging Face Transformers for IPUs with Optimum

November 30, 2021

research

Introducing the Data Measurements Tool: an Interactive Tool for Looking at Datasets

November 29, 2021

guide

Accelerating PyTorch distributed fine-tuning with Intel technologies

November 19, 2021

guideaudio

Fine-tuning XLS-R for Multi-Lingual ASR with 🤗 Transformers

November 15, 2021

partnershipsintelguide

Scaling up BERT-like model Inference on modern CPU - Part 2

November 4, 2021

analysisnlp

Large Language Models: A New Moore's Law?

October 26, 2021

communitynlp

Course Launch Community Event

October 26, 2021

communitynlp

Train a Sentence Embedding Model with 1B Training Pairs

October 25, 2021

analysis

The Age of Machine Learning As Code Has Arrived

October 20, 2021

communitycvnlp

Fine tuning CLIP with Remote Sensing (Satellite) images and captions

October 13, 2021

guide

Hosting your Models and Datasets on Hugging Face Spaces using Streamlit

October 5, 2021

guide

Showcase Your Projects in Spaces using Gradio

October 5, 2021

community

Summer at Hugging Face ☀️

September 24, 2021

guide

Introducing Optimum: The Optimization Toolkit for Transformers at Scale

September 14, 2021

Community Articles

Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text

and 2 others •

3 days ago

• 28

AI for Food Allergies

and 3 others •

7 days ago

• 27

LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR

and 2 others •

about 8 hours ago

• 22

Introducing the Massive Legal Embedding Benchmark (MLEB)

and 2 others •

7 days ago

• 17

How I Built Lightning-Fast Vector Search for Legal Documents

•

4 days ago

• 14

Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling

and 19 others •

3 days ago

• 14

Australian-made LLM beats OpenAI and Google at legal retrieval

and 2 others •

about 23 hours ago

• 13

GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms

and 15 others •

4 days ago

• 12

Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp

•

7 days ago

• 21

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

and 3 others •

3 days ago

• 11

Promoter-GPT: Writing DNA Instructions with Language Models

•

1 day ago

• 9

Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard

and 4 others •

2 days ago

• 8

There is no such thing as a tokenizer-free lunch

•

29 days ago

• 83

Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes

and 1 other •

1 day ago

• 6

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 700

Code a simple RAG from scratch

•

Oct 29, 2024

• 223

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 148

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 238

How Financial News Can Be Used to Train Good Financial Models

•

15 days ago

• 7

Understanding Vector Quantization in VQ-VAE

•

Aug 28, 2024

• 48

View all