DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 170
Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series By cgeorgiaw and 1 other • 4 days ago • 11
The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) By fdaudens • 4 days ago • 9
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • about 13 hours ago • 8
Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • 8 days ago • 12
Automated Discovery of High-Performance GPU Kernels with OpenEvolve By codelion • about 11 hours ago • 5
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 170
Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series By cgeorgiaw and 1 other • 4 days ago • 11
The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) By fdaudens • 4 days ago • 9
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • about 13 hours ago • 8
Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • 8 days ago • 12
Automated Discovery of High-Performance GPU Kernels with OpenEvolve By codelion • about 11 hours ago • 5