CoTox: Chain-of-Thought-Based Molecular Toxicity Reasoning and Prediction Paper • 2508.03159 • Published 5 days ago • 20
Med-PRM Collection This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards • 6 items • Updated 30 days ago
Outlier-Safe Pre-Training (OSP) Collection A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework. • 11 items • Updated Jun 26 • 3
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models Paper • 2506.19697 • Published Jun 24 • 44