General Optimization Problems [ RPO ] for Reasoning ! Collection Robust Policy Optimization (RPO) • 6 items • Updated 15 days ago • 1
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining Paper • 2505.07608 • Published 1 day ago • 53
Qwen_3 GGUF Collection Upto Full precision (float32) version • 5 items • Updated about 16 hours ago • 1
Qwen3 Moderate Behavioral Flexibility Collection moderately abliterated and improved context awareness and moderate behavioral flexibility variant of Qwen3. • 6 items • Updated 13 days ago • 1
Content Filters SigLIP2/ViT Collection Moderation, Balance, Classifiers • 6 items • Updated 5 days ago • 2
Messy OCR and Document retrieval Collection DocVQA, RealWorldQA, MTVQA • 2 items • Updated 10 days ago • 1
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published 8 days ago • 79