view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 8 days ago • 14
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published Jan 29 • 18
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 22 days ago • 82
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published Jan 26 • 41