Jamba 1.7 Collection The AI21 Jamba family of models are hybrid SSM-Transformer foundation models, blending speed, efficient long context processing, and accuracy. • 4 items • Updated Jul 2 • 12
view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other • 20 days ago • 42
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 618
view article Article Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face By mhillsmith and 2 others • May 3, 2024 • 14
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch By ariG23498 and 6 others • May 21 • 202
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation Paper • 2506.19852 • Published Jun 24 • 41
ERNIE 4.5 Collection collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 157
Align Your Flow: Scaling Continuous-Time Flow Map Distillation Paper • 2506.14603 • Published Jun 17 • 20
Discrete Diffusion LLM & MLLM Collection An collection of research/models in discrete diffusion large language and multimodal models • 57 items • Updated Jun 17 • 4
FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities Paper • 2505.20147 • Published May 26 • 1
Discrete Diffusion in Large Language and Multimodal Models: A Survey Paper • 2506.13759 • Published Jun 16 • 42
UIGEN-T3 HYBRID UI TAILWIND MODEL Collection UI BASED REASONING WITH COMPONENTS AND FULL WEBPAGES • 10 items • Updated 25 days ago • 6
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26 • 88
Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper • 2505.19297 • Published May 25 • 83
view article Article Exploring Quantization Backends in Diffusers By derekl35 and 2 others • May 21 • 39
OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding • 10 items • Updated 21 days ago • 17