What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 7 days ago • 23
LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress) By neph1 • 5 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 201
Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 10 days ago • 61
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7, 2024 • 93
Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation By codelion • 8 days ago • 5
Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 1 day ago • 4
OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* By stefanwebb and 2 others • 1 day ago • 4
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 58
What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • 7 days ago • 23
LLM agent experiment with a purpose-built RPG and tool calls. (Work in progress) By neph1 • 5 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 201
Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • 10 days ago • 61
makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch By AviSoori1x • May 7, 2024 • 93
Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation By codelion • 8 days ago • 5
Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 1 day ago • 4
OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?* By stefanwebb and 2 others • 1 day ago • 4
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 58