view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ May 15 β’ 117
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published Feb 16 β’ 165
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before By isaacchung and 2 others β’ Apr 24 β’ 14
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking Paper β’ 2405.07920 β’ Published May 13, 2024 β’ 2
Command Models Collection Latest Cohere Labs Command models β’ 9 items β’ Updated about 6 hours ago β’ 26
EuroBERT Collection Scaling Multilingual Encoders for European Languages β’ 4 items β’ Updated Mar 10 β’ 13
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 327
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model By EuroBERT and 3 others β’ Mar 10 β’ 146
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality By saurabhdash and 3 others β’ Mar 4 β’ 75
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 8 items β’ Updated 14 days ago β’ 25
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 149
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other β’ Feb 25 β’ 172
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated 24 days ago β’ 527
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others β’ Jan 20 β’ 48
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other β’ Jan 16 β’ 75
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw and 1 other β’ Jan 7 β’ 26
The Perfect Blend: Redefining RLHF with Mixture of Judges Paper β’ 2409.20370 β’ Published Sep 30, 2024 β’ 5