帖子、文章和讨论

开源开发者指南：欧盟《人工智能法案》解读

由 2024年12月2日 • 47

Community Articles

view all

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

5 days ago

• 21

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

•

3 days ago

• 6

Why We Built the OpenMDW License: A Comprehensive License for ML Models

•

Jul 2

• 23

Introducing ColQwen-Omni: Retrieve in every modality

and 4 others •

23 days ago

• 64

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 93

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 286

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 84

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 199

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

•

6 days ago

• 4

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

•

4 days ago

• 4

G2P Shrinks Speech Models

•

Feb 5

• 65

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 58

Mahjong: Where Grandmas Beat The Best LLMs

•

Feb 18

• 9

非工程师指南：训练 LLaMA 2 聊天机器人

由 2023年9月28日 • 6

3D 高斯点染简介

由 2023年9月18日 • 99

使用 PyTorch FSDP 微调 Llama 2 70B

由 2023年9月13日 • 28

在 SDXL 上用 T2I-Adapter 实现高效可控的文生图

由 2023年9月8日 guest • 8

AudioLDM 2，加速⚡️！

由 2023年8月30日 • 14

使用 BentoML 部署 🤗 Hugging Face 上的模型：DeepFloyd IF 实战

由 2023年8月9日 guest • 1

使用 FHE 实现加密大语言模型

由 2023年8月2日 guest • 16

手把手教你使用人工智能生成 3D 素材

由 2023年8月1日 • 9

在英特尔 CPU 上微调 Stable Diffusion 模型

由 2023年7月14日

用 Hugging Face 推理端点部署 LLM

由 2023年7月4日 • 15

Transformer 模型能够有效地进行时间序列预测 (使用 Autoformer)

由 2023年6月16日 • 37

如何在 Unity 游戏中集成 AI 语音识别？

由 2023年6月2日 • 6

使用 InstructPix2Pix 对 Stable Diffusion 进行指令微调

由 2023年5月23日 • 17

深入理解文生视频模型

由 2023年5月8日 • 43

Community Articles

What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models

and 5 others •

5 days ago

• 21

AWorld Multi-Agent System Hits #1 on GAIA Leaderboard

•

1 day ago

• 16

Towards Open Evolutionary Agents

and 1 other •

5 days ago

• 12

The GPT-OSS models are here… and they’re energy-efficient!

•

1 day ago

• 10

Code a simple RAG from scratch

•

Oct 29, 2024

• 146

Introducing Command A Vision: Multimodal AI built for Business

and 3 others •

9 days ago

• 61

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 642

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 160

LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress)

•

3 days ago

• 6

Why We Built the OpenMDW License: A Comprehensive License for ML Models

•

Jul 2

• 23

Introducing ColQwen-Omni: Retrieve in every modality

and 4 others •

23 days ago

• 64

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

May 7, 2024

• 93

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 286

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 84

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 199

Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation

•

6 days ago

• 4

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

•

4 days ago

• 4

G2P Shrinks Speech Models

•

Feb 5

• 65

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 58

Mahjong: Where Grandmas Beat The Best LLMs

•

Feb 18

• 9

View all