Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others • 6 days ago • 32
NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 2 days ago • 21
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • about 15 hours ago • 20
NH Prediction: Advanced AI System for Korean Agricultural Price Forecasting Research By openfree • about 13 hours ago • 16
Falcon-Arabic: A Breakthrough in Arabic Language Models By tiiuae and 7 others • about 15 hours ago • 12
Building an Open Ecosystem for Time Series Forecasting: Introducing TimesFM in Hugging Face By Nutanix and 1 other • 2 days ago • 9
Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other • 15 days ago • 28
🥬 LettuceDetect Goes Multilingual: Fine-tuning EuroBERT on Synthetic Translations By adaamko and 1 other • 2 days ago • 6
OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • 1 day ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 138
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 37
Highlights from the First ICLR 2025 Watermarking Workshop By hadyelsahar and 4 others • 7 days ago • 9
Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models. By tiiuae and 9 others • 6 days ago • 32
NVIDIA Cosmos Now Available On Hugging Face For Physical AI Reasoning By PranjaliJoshi and 1 other • 2 days ago • 21
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance By tiiuae and 5 others • about 15 hours ago • 20
NH Prediction: Advanced AI System for Korean Agricultural Price Forecasting Research By openfree • about 13 hours ago • 16
Falcon-Arabic: A Breakthrough in Arabic Language Models By tiiuae and 7 others • about 15 hours ago • 12
Building an Open Ecosystem for Time Series Forecasting: Introducing TimesFM in Hugging Face By Nutanix and 1 other • 2 days ago • 9
Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other • 15 days ago • 28
🥬 LettuceDetect Goes Multilingual: Fine-tuning EuroBERT on Synthetic Translations By adaamko and 1 other • 2 days ago • 6
OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve By codelion • 1 day ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 138
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment By NormalUhr • Feb 11 • 37
Highlights from the First ICLR 2025 Watermarking Workshop By hadyelsahar and 4 others • 7 days ago • 9