NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks
By
and 4 others
•
•
41AWorld Multi-Agent System Hits #1 on GAIA Leaderboard
By
•
•
22RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
By
and 9 others
•
•
19The GPT-OSS models are here… and they’re energy-efficient!
By
•
•
16ChatML vs Harmony: Understanding the new Format from OpenAI 🔍
By
•
•
12Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning
By
•
•
11Code a simple RAG from scratch
By
•
•
152<p style="text-align:center;"> Bridging the Gap: Making Robotics Feel Like Machine Learning </p>
By
•
•
10Announcing the Synthetic Online Conversations Dataset (SOC)
By
•
•
10Uncensor any LLM with abliteration
By
•
•
649DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge
By
•
•
205OpenAI just dropped two massive open-weight models — *but how do we separate the reality from the hype?*
By
and 2 others
•
•
9From GRPO to DAPO and GSPO: What, Why, and How
By
•
•
8Luth: Efficient French Specialization for Small Language Models
By
and 1 other
•
•
8What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware
By
•
•
7The Missing Semester of AI for Organizations #1: LLM Security
By
•
•
8Introducing : 🤏🏻🏭SmolFactory
By
•
•
5What I Learned Upscaling a Long-distance Midjourney Photo w/ Stable Diffusion PLUS unboxing Qwen Image & Wan 2.2
By
•
•
5How I Built 7 Custom Gradio Components in Just 12 Days!
By
•
•
5How to Run a Hugging Face Model in JAX (Part 3)
By
•
•
5