view article Article Building Moon Bot: A Slack-Native Coding Agent Backed by HuggingFace Buckets huggingface • 10 days ago • 46
🤏 Smol-Data Collection Tried and tested mixes for strong pretraining. Inspired by https://huggingface.co/blog/codelion/optimal-dataset-mixing • 14 items • Updated Mar 2 • 13
Parallel Loop Transformer for Efficient Test-Time Computation Scaling Paper • 2510.24824 • Published Oct 28, 2025 • 18
MuPT: A Generative Symbolic Music Pretrained Transformer Paper • 2404.06393 • Published Apr 9, 2024 • 16
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 167
view article Article We Got Claude to Fine-Tune an Open Source LLM burtenshaw, evalstate • Dec 4, 2025 • 629
view article Article Training Design for Text-to-Image Models: Lessons from Ablations Photoroom • Feb 3 • 77
view article Article H Company's new Holo2 model takes the lead in UI Localization Hcompany • Feb 3 • 7
view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 burtenshaw, evalstate, merve, pcuenq • Jan 28 • 158
view article Article Introducing Daggr: Chain apps programmatically, inspect visually +3 merve, ysharma, abidlabs, hysts, pcuenq • Jan 29 • 107
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 danf, mber, moshew • Dec 4, 2025 • 40
Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation Paper • 2510.06961 • Published Oct 8, 2025 • 13
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data Paper • 2502.05564 • Published Feb 8, 2025 • 2
TabPFN-2.5: Advancing the State of the Art in Tabular Foundation Models Paper • 2511.08667 • Published Nov 11, 2025 • 6