Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
26
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
skar0
updated
a model
about 3 hours ago
AlignmentResearch/pineapple-policy-oskar_006b_grpo_training
skar0
published
a model
about 3 hours ago
AlignmentResearch/pineapple-policy-oskar_006b_grpo_training
taufeeque
updated
a dataset
1 day ago
AlignmentResearch/backdoor-dataset-free-male-trigger
View all activity
Team members
13
AlignmentResearch
's models
3,764
Sort: Recently updated
AlignmentResearch/robust_llm_pythia-2.8b_clf_wl_v-ian-140_s-4
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-2.8b_clf_wl_v-ian-140_s-1
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-2.8b_clf_spam_v-ian-139_s-0
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-2.8b_clf_wl_v-ian-140_s-3
Updated
Nov 2, 2024
•
10
AlignmentResearch/robust_llm_pythia-12b_clf_helpful_v-ian-136_s-2
Updated
Nov 2, 2024
•
14
AlignmentResearch/robust_llm_pythia-2.8b_clf_wl_v-ian-140_s-0
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-2.8b_clf_wl_v-ian-140_s-2
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-12b_clf_helpful_v-ian-136_s-1
Updated
Nov 2, 2024
•
27
AlignmentResearch/robust_llm_pythia-2.8b_clf_imdb_v-ian-137_s-2
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-2.8b_clf_pm_v-ian-138_s-2
Updated
Nov 2, 2024
•
10
AlignmentResearch/robust_llm_pythia-2.8b_clf_pm_v-ian-138_s-3
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-2.8b_clf_pm_v-ian-138_s-4
Updated
Nov 2, 2024
•
11
AlignmentResearch/robust_llm_pythia-2.8b_clf_pm_v-ian-138_s-0
Updated
Nov 2, 2024
•
13
AlignmentResearch/robust_llm_pythia-2.8b_clf_pm_v-ian-138_s-1
Updated
Nov 2, 2024
•
13
AlignmentResearch/robust_llm_pythia-6.9b_clf_helpful_v-ian-136_s-0
Updated
Nov 2, 2024
•
13
AlignmentResearch/robust_llm_pythia-2.8b_clf_imdb_v-ian-137_s-0
Updated
Nov 2, 2024
•
13
AlignmentResearch/robust_llm_pythia-2.8b_clf_imdb_v-ian-137_s-1
Updated
Nov 2, 2024
•
12
AlignmentResearch/robust_llm_pythia-12b_clf_harmless_v-ian-135c_s-2
Updated
Nov 2, 2024
•
4
AlignmentResearch/robust_llm_pythia-12b_clf_harmless_v-ian-135c_s-3
Updated
Nov 2, 2024
•
14
AlignmentResearch/robust_llm_pythia-12b_clf_harmless_v-ian-135c_s-4
Updated
Nov 1, 2024
•
28
AlignmentResearch/robust_llm_pythia-6.9b_clf_harmless_v-ian-135c_s-3
Updated
Nov 1, 2024
•
4
AlignmentResearch/robust_llm_pythia-6.9b_clf_harmless_v-ian-135c_s-4
Updated
Nov 1, 2024
•
7
AlignmentResearch/robust_llm_pythia-12b_clf_harmless_v-ian-135c_s-1
Updated
Nov 1, 2024
•
20
AlignmentResearch/robust_llm_pythia-6.9b_clf_harmless_v-ian-135c_s-2
Updated
Nov 1, 2024
•
23
AlignmentResearch/robust_llm_pythia-6.9b_clf_harmless_v-ian-135c_s-1
Updated
Nov 1, 2024
•
4
AlignmentResearch/robust_llm_pythia-12b_clf_harmless_v-ian-135c_s-0
Updated
Nov 1, 2024
•
21
AlignmentResearch/robust_llm_pythia-6.9b_clf_harmless_v-ian-135c_s-0
Updated
Nov 1, 2024
•
4
AlignmentResearch/robust_llm_pythia-6.9b_clf_harmless_v-ian-135a_s-0
Updated
Oct 30, 2024
AlignmentResearch/robust_llm_clf_imdb_pythia-2.8b_s-1_adv_tr_gcg_t-1
Updated
Oct 24, 2024
AlignmentResearch/robust_llm_clf_imdb_pythia-2.8b_s-2_adv_tr_gcg_t-2
Updated
Oct 24, 2024
Previous
1
...
14
15
16
17
18
...
126
Next