NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks
By
and 4 others
•
•
49RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
By
and 9 others
•
•
20ChatML vs Harmony: Understanding the new Format from OpenAI 🔍
By
•
•
13How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio
By
•
•
12Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning
By
•
•
12<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>
By
•
•
10OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*
By
and 2 others
•
•
10Announcing the Synthetic Online Conversations Dataset (SOC)
By
•
•
10From GRPO to DAPO and GSPO: What, Why, and How
By
•
•
9Uncensor any LLM with abliteration
By
•
•
650Code a simple RAG from scratch
By
•
•
154AWorld Multi-Agent System Hits #1 on GAIA Leaderboard
By
•
•
23Luth: Efficient French Specialization for Small Language Models
By
and 1 other
•
•
8DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
206The Missing Semester of AI for Organizations #1: LLM Security
By
•
•
8The GPT-OSS models are here… and they’re energy-efficient!
By
•
•
16What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware
By
•
•
7Kimina-Prover-RL
By
and 18 others
•
•
6KV Caching Explained: Optimizing Transformer Inference Efficiency
By
•
•
113Introducing : 🤏🏻🏭SmolFactory
By
•
•
5