Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them) By FriendliAI and 2 others • 4 days ago • 21
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • 8 days ago • 23
Why We Built the OpenMDW License: A Comprehensive License for ML Models By linuxfoundation • 3 days ago • 10
Should We Still Pretrain Encoders with Masked Language Modeling? By Nicolas-BZRD and 3 others • 3 days ago • 9
LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • 4 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 175
Common Pitfalls in Sharing Open Source Models on Hugging Face (and How to Dodge Them) By FriendliAI and 2 others • 4 days ago • 21
Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub By nvidia and 10 others • 8 days ago • 23
Why We Built the OpenMDW License: A Comprehensive License for ML Models By linuxfoundation • 3 days ago • 10
Should We Still Pretrain Encoders with Masked Language Modeling? By Nicolas-BZRD and 3 others • 3 days ago • 9
LLMs recognise bias but also reproduce harmful stereotypes: an analysis of bias in leading LLMs By davidberenstein1957 and 3 others • 4 days ago • 7
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 175