Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
2
Kaiwen Wang
kaiwenw
Follow
0 followers
·
2 following
https://kaiwenw.github.io/
kaiwenw_ai
kaiwenw
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
19 days ago
kaiwenw/single_node_run2-step-12170
published
a model
19 days ago
kaiwenw/single_node_run2-step-12170
updated
a model
19 days ago
kaiwenw/single_node_run2-step-12150
View all activity
Organizations
kaiwenw
's datasets
220
Sort: Recently updated
kaiwenw/open_r1_apr9_round1_combined_balanced
Viewer
•
Updated
Apr 14
•
49.4k
•
4
kaiwenw/open_r1_apr9_round1_combined_random
Viewer
•
Updated
Apr 14
•
49.4k
•
62
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_32B_tokenized
Viewer
•
Updated
Apr 13
•
49.4k
•
72
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_14B_tokenized
Viewer
•
Updated
Apr 11
•
49.4k
•
2
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_7B_tokenized
Viewer
•
Updated
Apr 11
•
49.4k
•
45
kaiwenw/open_r1_apr9_DeepSeek_R1_Distill_Qwen_1.5B_tokenized
Viewer
•
Updated
Apr 11
•
49.4k
•
94
kaiwenw/open_r1_apr9
Viewer
•
Updated
Apr 9
•
49.4k
•
25
kaiwenw/combine_1.5B_7B_and_32B
Viewer
•
Updated
Apr 4
•
49.5k
•
21
kaiwenw/combine_1.5B_and_blockwise
Viewer
•
Updated
Apr 4
•
49.5k
•
114
kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_1.5B_tokenized
Viewer
•
Updated
Apr 4
•
49.5k
•
3
kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_32B_tokenized
Viewer
•
Updated
Apr 3
•
49.5k
•
3
kaiwenw/open_r1_mar2_mar20_1.5b_n_4_nl_8_tokenized
Viewer
•
Updated
Apr 3
•
49.5k
•
5
kaiwenw/open_r1_mar2_DeepSeek_R1_Distill_Qwen_7B_tokenized
Viewer
•
Updated
Mar 30
•
49.5k
•
5
kaiwenw/open_r1_mar2_round_1_tokenized
Viewer
•
Updated
Mar 4
•
49.5k
•
27
kaiwenw/open_r1_mar2_round_1
Viewer
•
Updated
Mar 3
•
45.3k
•
17
kaiwenw/open_r1_mar2
Viewer
•
Updated
Mar 2
•
49.5k
•
19
kaiwenw/verified_open_r1
Viewer
•
Updated
Feb 20
•
58.1k
•
27
kaiwenw/test_open_r1
Viewer
•
Updated
Feb 20
•
1k
•
31
kaiwenw/aft_after_jaft_test
Viewer
•
Updated
Jan 13
•
1.41k
•
22
kaiwenw/dec9_sp1_repeat_5_pref_jdpo_75_chosen_25_reject
Viewer
•
Updated
Dec 10, 2024
•
14.1k
•
20
kaiwenw/dec9_sp1_repeat_5_pref_jdpo_25_chosen_75_reject
Viewer
•
Updated
Dec 10, 2024
•
18.6k
•
23
kaiwenw/dec9_sp1_repeat_5_pref_jdpo_50_chosen_50_reject
Viewer
•
Updated
Dec 10, 2024
•
37.9k
•
26
kaiwenw/dec9_sp1_repeat_5_pref_jdpo_all_reject_first
Viewer
•
Updated
Dec 10, 2024
•
26.7k
•
20
kaiwenw/dec9_sp1_repeat_5_pref_jdpo_all_chosen_first
Viewer
•
Updated
Dec 10, 2024
•
20.1k
•
20
kaiwenw/dec9_sp1_repeat_5_pref_jdpo
Viewer
•
Updated
Dec 10, 2024
•
44.5k
•
22
kaiwenw/dec9_sp1_repeat_5_pref_jdpo_n_7_temp_0.9
Viewer
•
Updated
Dec 10, 2024
•
36.4k
•
19
kaiwenw/dec9_sp1_repeat_5
Viewer
•
Updated
Dec 9, 2024
•
18.2k
•
12
kaiwenw/dec9_sp1_pref_jdpo_75_chosen_25_reject
Viewer
•
Updated
Dec 9, 2024
•
2.39k
•
10
kaiwenw/dec9_sp1_pref_jdpo_25_chosen_75_reject
Viewer
•
Updated
Dec 9, 2024
•
3.39k
•
13
kaiwenw/dec9_sp1_pref_jdpo_50_chosen_50_reject
Viewer
•
Updated
Dec 9, 2024
•
6.4k
•
13
Previous
1
...
3
4
5
6
7
8
Next