Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
174.4
TFLOPS
3
15
93
rasdani
PRO
rasdani
Follow
johannhartmann's profile picture
consdi's profile picture
jachermann's profile picture
19 followers
·
71 following
rasdani_
rasdani
rasdani
AI & ML interests
None yet
Recent Activity
updated
a collection
about 11 hours ago
PRIME-RL
updated
a dataset
about 11 hours ago
PrimeIntellect/deepcoder-gold-standard-solutions
published
a dataset
about 11 hours ago
PrimeIntellect/deepcoder-gold-standard-solutions
View all activity
Organizations
rasdani
's models
37
Sort: Recently updated
rasdani/deepseek_r1_qwen14b_swe_rl_8k
15B
•
Updated
Jul 12
•
2
rasdani/deepseek_r1_llama_8b_swe_rl_8k_12_epochs
8B
•
Updated
Jul 10
•
7
rasdani/qwen3_8b_swe_rl_8k
8B
•
Updated
Jul 7
•
4
rasdani/deepseek_r1_7b_gh_patches_2k_fixed_reward
8B
•
Updated
Jun 29
•
5
rasdani/deepseek_r1_7b_gh_patches_2k
8B
•
Updated
Jun 28
•
4
rasdani/crux-eval_math-eval-logs
Updated
Jun 25
rasdani/git-diff-Qwen-4B-10k
4B
•
Updated
Jun 25
•
4
rasdani/git-diff-Qwen-4B-10k-checkpoints
Updated
Jun 25
rasdani/git-diff-Qwen-4B-32k-checkpoints
Updated
Jun 23
rasdani/git-diff-Qwen-4B-30k
4B
•
Updated
Jun 22
•
6
rasdani/git-diff-Qwen-4B
4B
•
Updated
Jun 17
•
6
rasdani/git-diff-Qwen-1.7B
2B
•
Updated
Jun 16
•
6
rasdani/git-diff-Qwen-1.7-B
2B
•
Updated
Jun 16
•
6
rasdani/simple-math-Qwen-1.5B
2B
•
Updated
Jun 15
•
4
rasdani/qwen3_0_6b_function_rm
0.8B
•
Updated
May 22
•
3
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-8192k
0.5B
•
Updated
Apr 8
•
3
rasdani/Qwen2.5-0.5B-simpleRL-Zoo
Text Generation
•
0.5B
•
Updated
Apr 6
•
2
rasdani/smolR1-Qwen2.5-0.5B
Text Generation
•
0.5B
•
Updated
Mar 31
•
3
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-no-KL
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-3072k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-4096k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2560k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-2048k
Updated
Mar 31
rasdani/Qwen2.5-0.5B-simpleRL-Zoo-first-try
0.5B
•
Updated
Mar 29
•
5
rasdani/Qwen-1.5B-Distill-GRPO
Text Generation
•
2B
•
Updated
Mar 28
•
4
rasdani/Qwen-0.5B-Instruct-GRPO
Updated
Mar 27
rasdani/gsm8k_qwen2.5-0.5b
0.5B
•
Updated
Mar 11
•
2
rasdani/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
Mar 9
rasdani/Qwen2.5-0.5B-Open-R1-Code-GRPO
Text Generation
•
0.6B
•
Updated
Mar 8
•
3
rasdani/Qwen2.5-7B-Instruct-GRPO-unsloth
Text Generation
•
8B
•
Updated
Mar 2
•
7
Previous
1
2
Next