view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 9 days ago • 53
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 14 days ago • 850
LeWM Collection Official checkpoints and datasets related to LeWM paper. • 9 items • Updated 20 days ago • 23
view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 24 days ago • 16
view article Article Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets 26 days ago • 17
Nemotron-Cascade 2 Collection Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 1 day ago • 48
view article Article A framework and leaderboard for Retrieval Pipelines evaluation on ViDoRe v3 Feb 27 • 12