帖子、文章和讨论

开源开发者指南：欧盟《人工智能法案》解读

由 2024年12月2日 • 47

Community Articles

view all

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

7 days ago

• 23

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

•

5 days ago

• 7

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 201

Introducing ColQwen-Omni: Retrieve in every modality

and 4 others •

25 days ago

• 64

Introducing Command A Vision: Multimodal AI built for Business

and 3 others •

10 days ago

• 61

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 93

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

•

8 days ago

• 5

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

•

1 day ago

• 4

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

•

6 days ago

• 4

From GRPO to DAPO and GSPO: What, Why, and How

•

1 day ago

• 4

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

1 day ago

• 4

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 160

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 286

G2P Shrinks Speech Models

•

Feb 5

• 65

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 58

在 Hugging Face Hub 上分享你的开源数据集

由 2024年11月12日 • 28

欢迎 Stable Diffusion 3.5 Large 加入 🧨 Diffusers

由 2024年10月22日 • 54

Accelerate 1.0.0

由 2024年9月13日 • 53

ggml 简介

由 2024年8月13日 • 229

基于 Quanto 和 Diffusers 的内存高效 transformer 扩散模型

由 2024年7月30日 • 67

TGI 多-LoRA：部署一次，搞定 30 个模型的推理服务

由 2024年7月18日 • 59

从 DeepSpeed 到 FSDP，再回到 Hugging Face Accelerate

由 2024年6月13日 • 55

欢迎 Stable Diffusion 3 加入 🧨 Diffusers

由 2024年6月12日 • 96

TGI 基准测试

由 2024年5月29日 • 32

用 Sentence Transformers v3 训练和微调嵌入模型

由 2024年5月28日 • 240

使用 Gradio 的“热重载”模式快速开发 AI 应用

由 2024年4月16日 • 27

视觉语言模型详解

由 2024年4月11日 • 430

Hugging Face Transformers 萌新完全指南

由 2024年3月22日 • 86

用于显著提高检索速度和降低成本的二进制和标量嵌入量化

由 2024年3月22日 guest • 100

Community Articles

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

7 days ago

• 23

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

•

3 days ago

• 16

Towards Open Evolutionary Agents

and 1 other •

6 days ago

• 12

The GPT-OSS models are here… and they’re energy-efficient!

•

3 days ago

• 12

Code a simple RAG from scratch

•

Oct 29, 2024

• 147

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 645

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

•

5 days ago

• 7

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 201

Introducing ColQwen-Omni: Retrieve in every modality

and 4 others •

25 days ago

• 64

Introducing Command A Vision: Multimodal AI built for Business

and 3 others •

10 days ago

• 61

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 93

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

•

8 days ago

• 5

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

•

1 day ago

• 4

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

•

6 days ago

• 4

From GRPO to DAPO and GSPO: What, Why, and How

•

1 day ago

• 4

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

and 2 others •

1 day ago

• 4

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 160

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 286

G2P Shrinks Speech Models

•

Feb 5

• 65

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 58

View all

帖子、文章和讨论

开源开发者指南：欧盟《人工智能法案》解读

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

Towards Open Evolutionary Agents

The GPT-OSS models are here… and they’re energy-efficient!

Code a simple RAG from scratch

Uncensor any LLM with abliteration

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing ColQwen-Omni: Retrieve in every modality

Introducing Command A Vision: Multimodal AI built for Business

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

From GRPO to DAPO and GSPO: What, Why, and How

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

Introduction to State Space Models (SSM)

ColPali: Efficient Document Retrieval with Vision Language Models 👀

G2P Shrinks Speech Models

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

在 Hugging Face Hub 上分享你的开源数据集

欢迎 Stable Diffusion 3.5 Large 加入 🧨 Diffusers

Accelerate 1.0.0

ggml 简介

基于 Quanto 和 Diffusers 的内存高效 transformer 扩散模型

TGI 多-LoRA：部署一次，搞定 30 个模型的推理服务

从 DeepSpeed 到 FSDP，再回到 Hugging Face Accelerate

欢迎 Stable Diffusion 3 加入 🧨 Diffusers

TGI 基准测试

用 Sentence Transformers v3 训练和微调嵌入模型

使用 Gradio 的“热重载”模式快速开发 AI 应用

视觉语言模型详解

Hugging Face Transformers 萌新完全指南

用于显著提高检索速度和降低成本的二进制和标量嵌入量化

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

Towards Open Evolutionary Agents

The GPT-OSS models are here… and they’re energy-efficient!

Code a simple RAG from scratch

Uncensor any LLM with abliteration

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Introducing ColQwen-Omni: Retrieve in every modality

Introducing Command A Vision: Multimodal AI built for Business

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

From GRPO to DAPO and GSPO: What, Why, and How

OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*

Introduction to State Space Models (SSM)

ColPali: Efficient Document Retrieval with Vision Language Models 👀

G2P Shrinks Speech Models

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?

OpenAI just dropped two massive open-weight models — but how do we separate the reality from the hype?