Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text By isaacchung and 2 others • 3 days ago • 28
LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • about 8 hours ago • 22
Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling By RichardBian and 19 others • 3 days ago • 14
Australian-made LLM beats OpenAI and Google at legal retrieval By isaacus and 2 others • about 23 hours ago • 13
GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms By otellm and 15 others • 4 days ago • 12
Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp By huggingface • 7 days ago • 21
Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models By nvidia and 3 others • 3 days ago • 11
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard By nvidia and 4 others • 2 days ago • 8
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes By nvidia and 1 other • 1 day ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 238
Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text By isaacchung and 2 others • 3 days ago • 28
LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR By lightonai and 2 others • about 8 hours ago • 22
Art of Focus: Page-Aware Sparse Attention and Ling 2.0’s Quest for Efficient Context Length Scaling By RichardBian and 19 others • 3 days ago • 14
Australian-made LLM beats OpenAI and Google at legal retrieval By isaacus and 2 others • about 23 hours ago • 13
GSMA Open-Telco LLM Benchmarks 2.0: The first dedicated LLM Evaluation for Telecoms By otellm and 15 others • 4 days ago • 12
Announcing Hugging Face Fundamentals: A New Learning Track on DataCamp By huggingface • 7 days ago • 21
Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models By nvidia and 3 others • 3 days ago • 11
Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard By nvidia and 4 others • 2 days ago • 8
Nemotron’s Open Secret: Accelerating AI Development with Open Models, Data, and Recipes By nvidia and 1 other • 1 day ago • 6
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 238