GitBag/ultrainteract_multiturn_sampled_h_from_sampled_len_ckp_1 Viewer • Updated Sep 7, 2024 • 123k • 4
GitBag/ultrainteract_multiturn_sampled_h_from_sampled_len_ckp_0 Viewer • Updated Sep 7, 2024 • 123k • 5
GitBag/llama3-ultrafeedback-armo-1024-20k-base-20k-1723066371_harvard Viewer • Updated Aug 14, 2024 • 39.4k • 14
GitBag/llama3-ultrafeedback-armo-1024-20k-base-20k-1723066371 Viewer • Updated Aug 14, 2024 • 39.4k • 12
GitBag/llama3-ultrafeedback-armo-1024-chosen_sample-reject_won_harvard Viewer • Updated Aug 14, 2024 • 55.9k • 12
GitBag/llama3-ultrafeedback-armo-1024-chosen_bon-reject_sample_harvard Viewer • Updated Aug 13, 2024 • 55.9k • 11
GitBag/llama3-ultrafeedback-armo-1024-chosen_sample-reject_won Viewer • Updated Aug 13, 2024 • 55.9k • 9