WPRM
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models
53

WPRM/qwen2.5-ar-reward-rejected-action-ablation-1
3B
•
Updated
•
10

WPRM/llama-3.1-8b-ar-rm-mtl
8B
•
Updated
•
7

WPRM/qwen3-8b-ar-reward-cot-mtl-checklist-enhanced
8B
•
Updated
•
3

WPRM/qwen-3b-ar-reward-cot-mtl-checklist-enhanced
3B
•
Updated
•
5

WPRM/qwen3-8b-checklist-enhanced
8B
•
Updated
•
6

WPRM/qwen3-ar-reward-cot-mtl-same-ratio-epoch2
8B
•
Updated
•
4

WPRM/qwen3-ar-reward-cot-mtl
8B
•
Updated
•
4

WPRM/qwen3-ar-reward-cot-mtl-epoch1
8B
•
Updated
•
7

WPRM/qwen2_5vl-3b_ar_reward_cot_multimodal_mtl
4B
•
Updated
•
7

WPRM/qwen2.5-ar-reward-cot-mtl
3B
•
Updated
•
6
datasets
118
WPRM/gitlab_failed_data
Viewer
•
Updated
•
16
•
81
WPRM/ours_8b_mtl_enhanced_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
74
WPRM/ours_3b_mtl_enhanced_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
67
WPRM/4omini_obs_annotated_workarena_checklist
Viewer
•
Updated
•
334
•
69
WPRM/ours_llama_8b_annotated_walite_combined_checklist
Viewer
•
Updated
•
812
•
69
WPRM/workarena_checklist_raw
Viewer
•
Updated
•
334
•
62
WPRM/human_dataset_sample_50
Viewer
•
Updated
•
50
•
66
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-3
Viewer
•
Updated
•
21.8k
•
53
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-2
Viewer
•
Updated
•
18.1k
•
61
WPRM/webprm-rejected-action-ablation-dataset-rejected-action-1
Viewer
•
Updated
•
12.1k
•
59