Molmo2 Data Collection Artifacts for the Molmo2 data release β’ 16 items β’ Updated 15 days ago β’ 29
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning Paper β’ 2512.02551 β’ Published Dec 2, 2025 β’ 12
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper β’ 2511.18538 β’ Published Nov 23, 2025 β’ 283
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch Paper β’ 2512.02395 β’ Published Dec 2, 2025 β’ 47
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions Paper β’ 2509.17177 β’ Published Sep 21, 2025 β’ 13
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper β’ 2511.06221 β’ Published Nov 9, 2025 β’ 132
Diffusion Language Models are Super Data Learners Paper β’ 2511.03276 β’ Published Nov 5, 2025 β’ 128
Emu3.5 Collection Native Multimodal Models are World Learners π β’ 4 items β’ Updated 14 days ago β’ 72
Emu3.5: Native Multimodal Models are World Learners Paper β’ 2510.26583 β’ Published Oct 30, 2025 β’ 108
Uniform Discrete Diffusion with Metric Path for Video Generation Paper β’ 2510.24717 β’ Published Oct 28, 2025 β’ 40
Reasoning Efficiency Research Collection Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs β’ 3 items β’ Updated 15 days ago β’ 11
view article Article `LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot` +9 Sep 16, 2025 β’ 47
Glyph: Scaling Context Windows via Visual-Text Compression Paper β’ 2510.17800 β’ Published Oct 20, 2025 β’ 67
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper β’ 2509.16506 β’ Published Sep 20, 2025 β’ 19
The Ultimate Collection of Code Classifiers Collection π₯ 15 classifiers, 124M parameters, one per programming languageβ for assessing the educational value of GitHub code β’ 15 items β’ Updated May 5, 2025 β’ 15
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling Paper β’ 2509.23909 β’ Published Sep 28, 2025 β’ 32
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. β’ 358 items β’ Updated 15 days ago β’ 21