functiongemma fine-tuned on mobile-actions Collection A collection of functiongemma-270m-it models fine-tuned on mobile actions dataset for Spanish, French and Italian • 3 items • Updated 4 days ago • 3
view article Article Aligning to What? Rethinking Agent Generalization in MiniMax M2 Oct 30, 2025 • 40
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published Nov 27, 2025 • 221
Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 12 days ago • 46
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published Nov 13, 2025 • 49
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 56
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6, 2025 • 127
view article Article Luth: Efficient French Specialization for Small Language Models Aug 11, 2025 • 18
Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Paper • 2510.14880 • Published Oct 16, 2025 • 18
view article Article Model statistics of the 50 most downloaded entities on Hugging Face Oct 13, 2025 • 37