Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
WPRM
community
Activity Feed
Follow
7
AI & ML interests
None defined yet.
Recent Activity
KimSHine
updated
a dataset
about 1 month ago
WPRM/gitlab_failed_data
KimSHine
published
a dataset
about 1 month ago
WPRM/gitlab_failed_data
iruno
updated
a dataset
about 1 month ago
WPRM/ours_8b_mtl_enhanced_annotated_workarena_checklist
View all activity
Team members
7
WPRM
's models
53
Sort: Recently updated
WPRM/qwen2.5-ar-reward-rejected-action-ablation-1
3B
•
Updated
Jul 29
•
12
WPRM/llama-3.1-8b-ar-rm-mtl
8B
•
Updated
Jul 27
•
8
WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B
•
Updated
May 14
•
2
WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B
•
Updated
May 14
•
5
WPRM/qwen3-8b-checklist-enhanced
8B
•
Updated
May 13
•
5
WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B
•
Updated
May 12
•
4
WPRM/qwen3-ar-reward-cot-mtl
8B
•
Updated
May 11
•
4
WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B
•
Updated
May 10
•
6
WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B
•
Updated
May 10
•
7
WPRM/qwen2.5-ar-reward-cot-mtl
3B
•
Updated
May 8
•
6
WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_final_new
4B
•
Updated
May 7
•
4
WPRM/qwen2.5-ar-reward-cot-final-new
3B
•
Updated
May 6
•
5
WPRM/qwen2.5-ar-reward-cot-final-epoch1
3B
•
Updated
May 6
•
4
WPRM/qwen2.5-3b-rm-scroll-filtered
3B
•
Updated
May 5
•
1
WPRM/qwen2.5-ar-reward-cot-final
3B
•
Updated
May 3
•
4
WPRM/llama-3.2-3b-policy-offline-PPO
3B
•
Updated
May 3
•
4
WPRM/qwen2.5-ar-reward-cot-new
3B
•
Updated
May 1
•
6
WPRM/llama-3.2-3b-policy
3B
•
Updated
May 1
•
4
WPRM/qwen2_5vl-3b_ar_reward_cot_wo_checklist_multimodal
4B
•
Updated
Apr 30
•
4
WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal
4B
•
Updated
Apr 30
•
4
WPRM/dataset_images
Updated
Apr 27
WPRM/qwenvl_bt_rm_wo_checklist
4B
•
Updated
Apr 26
•
2
WPRM/llama-3.1-policy-epoch1
8B
•
Updated
Apr 26
•
5
WPRM/qwen2.5-ar-reward-cot
3B
•
Updated
Apr 25
•
6
WPRM/qwen2.5-ar-reward-cot-wo-checklist
3B
•
Updated
Apr 25
•
5
WPRM/qwenvl_bt_rm_lr_1e-6
4B
•
Updated
Apr 24
•
1
WPRM/qwenvl_bt_rm_wo_checklist_lr_1e-6
4B
•
Updated
Apr 24
•
1
WPRM/qwenvl_bt_rm
4B
•
Updated
Apr 23
•
1
WPRM/qwenvl_reward_multimodal_llamafactory
4B
•
Updated
Apr 22
•
4
WPRM/qwen2.5-ar-reward-no-cot
3B
•
Updated
Apr 18
•
5
Previous
1
2
Next