A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Data Mining and Information Systems Lab
dmis-lab
AI & ML interests
None yet
Recent Activity
View all activity
Organizations
None yet
Med-PRM
This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
-
dmis-lab/llama-3.1-medprm-reward-v1.0
Text Generation • 8B • Updated • 72 • 14 -
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Viewer • Updated • 11.7k • 22 -
dmis-lab/llama-3.1-medprm-reward-training-set
Viewer • Updated • 11.7k • 166 • 5 -
dmis-lab/llama-3.1-medprm-reward-raw-test-set
Viewer • Updated • 5.47k • 16
Outlier-Safe Pre-Training (OSP)
A collection of ablation and final models trained on the Outlier-Safe Pre-Training (OSP) framework.
Med-PRM
This collection hosts Med-PRM series introduced in paper, Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards
-
dmis-lab/llama-3.1-medprm-reward-v1.0
Text Generation • 8B • Updated • 72 • 14 -
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Viewer • Updated • 11.7k • 22 -
dmis-lab/llama-3.1-medprm-reward-training-set
Viewer • Updated • 11.7k • 166 • 5 -
dmis-lab/llama-3.1-medprm-reward-raw-test-set
Viewer • Updated • 5.47k • 16
models
54

dmis-lab/OSP-1.4B-100B-Shampoo-SSNorm-EmbProj
1B
•
Updated
•
9
•
4

dmis-lab/OSP-1.4B-100B-Shampoo-SSNorm
1B
•
Updated
•
12
•
3

dmis-lab/OSP-1.4B-100B-Muon-SSNorm-EmbProj
1B
•
Updated
•
12
•
4

dmis-lab/OSP-1.4B-100B-Muon-EmbProj
1B
•
Updated
•
9
•
3

dmis-lab/OSP-1.4B-100B-Muon-SSNorm
1B
•
Updated
•
11
•
3

dmis-lab/OSP-1.4B-100B-Muon-Only
1B
•
Updated
•
11
•
3

dmis-lab/OSP-1.4B-100B-Muon
1B
•
Updated
•
8
•
3

dmis-lab/OSP-1.4B-100B-Adam
1B
•
Updated
•
15
•
3

dmis-lab/OSP-1.4B-1T-Muon-SSNorm-EmbProj
1B
•
Updated
•
10
•
4

dmis-lab/OSP-1.4B-1T-Adam
1B
•
Updated
•
14
•
3
datasets
10
dmis-lab/llama-3.1-medprm-reward-raw-test-set
Viewer
•
Updated
•
5.47k
•
16
dmis-lab/llama-3.1-medprm-reward-raw-training-set
Viewer
•
Updated
•
11.7k
•
22
dmis-lab/llama-3.1-medprm-reward-test-set
Updated
•
25
•
2
dmis-lab/llama-3.1-medprm-reward-training-set
Viewer
•
Updated
•
11.7k
•
166
•
5
dmis-lab/TemporalHead
Viewer
•
Updated
•
11
•
242
•
1
dmis-lab/meerkat-instructions
Viewer
•
Updated
•
440k
•
169
•
6
dmis-lab/RF-Collection
Preview
•
Updated
•
62
•
1
dmis-lab/ChroKnowBench
Preview
•
Updated
•
81
•
7
dmis-lab/ETHIC
Viewer
•
Updated
•
1.99k
•
126
•
7
dmis-lab/MedLFQA
Viewer
•
Updated
•
4.95k
•
49
•
16