Lora & full finetune experiments on r1 distills to generate python code for math problems
Ram PRO
0-hero
AI & ML interests
All work on this profile is personal
Recent Activity
published
a model
3 days ago
0-hero/r1-7b-grpo-full
published
a model
3 days ago
0-hero/R1-7B-MATH-GRPO-FULL
published
a model
3 days ago
0-hero/r1-7B-grpo-80
Organizations
Collections
5
models
49

0-hero/r1-7B-grpo-v3.3-epoch-3
Updated

0-hero/r1-7B-grpo-v3.3-epoch-2
Updated

0-hero/r1-7B-grpo-v3.3-epoch-1
Updated

0-hero/r1-7B-grpo-v3.2-epoch-2
Updated

0-hero/r1-7B-grpo-v3.2-epoch-1
Updated

0-hero/r1-14B-grpo-v3.1-epoch-2
Updated

0-hero/r1-14B-grpo-v3.1-epoch-1
Updated
•
1

0-hero/r1-7B-grpo-v3.1-epoch-3
Updated

0-hero/r1-7B-grpo-v3.1-epoch-2
Updated
•
2

0-hero/r1-7B-grpo-v2-temp-1.0-60
Updated
datasets
14
0-hero/MATH
Viewer
•
Updated
•
331k
•
19
0-hero/audio-samples-fixed
Viewer
•
Updated
•
10
•
1.19k
0-hero/distilabel-math-preference-dpo
Viewer
•
Updated
•
2.42k
•
26
0-hero/lj_speech_with_spectogram_conversations
Viewer
•
Updated
•
13.1k
•
26
•
1
0-hero/lj_speech_with_spectogram
Viewer
•
Updated
•
13.1k
•
67
•
1
0-hero/Matter-0.2-alpha
Viewer
•
Updated
•
2.52M
•
45
•
3
0-hero/Matter-0.1
Viewer
•
Updated
•
2.25M
•
53
•
53
0-hero/Matter-0.1-Slim-D
Viewer
•
Updated
•
1.32M
•
35
0-hero/Matter-0.1-Slim-C
Viewer
•
Updated
•
343k
•
25
0-hero/Matter-0.1-Slim-B
Viewer
•
Updated
•
308k
•
55
•
1