Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper β’ 2506.01939 β’ Published Jun 2 β’ 177
view article Article The Transformers Library: standardizing model definitions By lysandre and 3 others β’ May 15 β’ 116
view article Article Vision Language Models (Better, Faster, Stronger) By merve and 4 others β’ May 12 β’ 503
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. β’ 43 items β’ Updated 1 day ago β’ 183
view article Article π¦Έπ»#14: What Is MCP, and Why Is Everyone β Suddenly!β Talking About It? By Kseniase β’ Mar 17 β’ 326
view article Article Rearchitecting Hugging Face Uploads and Downloads By jsulz and 2 others β’ Nov 26, 2024 β’ 48
view article Article From Files to Chunks: Improving Hugging Face Storage Efficiency By jsulz and 1 other β’ Nov 20, 2024 β’ 63
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr β’ Feb 7 β’ 204
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper β’ 2501.09686 β’ Published Jan 16 β’ 41
view article Article Train 400x faster Static Embedding Models with Sentence Transformers By tomaarsen β’ Jan 15 β’ 204
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention By sirluk β’ Oct 7, 2024 β’ 46
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 149
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy By medmekk and 5 others β’ Sep 18, 2024 β’ 264