Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 6 days ago • 37
<p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p> By hba123 and 2 others • 3 days ago • 9
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 8 days ago • 21
5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub By fdaudens and 1 other • about 18 hours ago • 8
FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages By davanstrien and 5 others • 8 days ago • 26
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 53
Seeing Isn’t Understanding: The Spatial Reasoning Gap in Vision-Language Models By KBayoud • 3 days ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 184
Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models By AI-MO and 17 others • 6 days ago • 37
<p style="text-align:center;"> Bourbaki (7b): SOTA 7B Algorithms for Putnam Bench (Part I: Reasoning MDPs)</p> By hba123 and 2 others • 3 days ago • 9
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 ✌️✌️✌️ By pollen-robotics and 2 others • 8 days ago • 21
5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub By fdaudens and 1 other • about 18 hours ago • 8
FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages By davanstrien and 5 others • 8 days ago • 26
Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face By dvgodoy • Feb 11 • 53
Seeing Isn’t Understanding: The Spatial Reasoning Gap in Vision-Language Models By KBayoud • 3 days ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 184