Artifacts for paper "Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements" (https://arxiv.org/abs/2410.08968)
Jack Zhang
jackzhang
AI & ML interests
None yet
Recent Activity
new activity
9 days ago
microsoft/CoSAlign-Test:Update README.md
updated
a dataset
9 days ago
jackzhang/CoSAlign-Test
Organizations
Collections
1
models
4
datasets
16
jackzhang/CoSAlign-Test
Viewer
•
Updated
•
3.2k
•
22
jackzhang/CoSAlign-Train-BT-WG
Viewer
•
Updated
•
125k
•
40
jackzhang/CoSApien
Viewer
•
Updated
•
200
•
185
•
1
jackzhang/nyt_texts_filtered_prompt_continuation
Viewer
•
Updated
•
28.4k
•
23
jackzhang/V5-bt-wg-addr_imp-train
Viewer
•
Updated
•
122k
•
12
jackzhang/V4-bt_gpt-4o_wg-train
Viewer
•
Updated
•
133k
•
23
jackzhang/bt_7cat_test_400_unseencat
Viewer
•
Updated
•
1.2k
•
18
jackzhang/bt_7cat_5spec_testset_400
Viewer
•
Updated
•
2k
•
67
jackzhang/V2-given_sys-ah-train-no_em
Viewer
•
Updated
•
61.1k
•
14
jackzhang/bt_multi_4-V1-given_sys_combine-test
Viewer
•
Updated
•
3.45k
•
44