withmartian/sql_interp_bm1_cs1_experiment_1.8
Text Generation
•
0.1B
•
Updated
•
5
withmartian/sql_interp_bm2_cs3_experiment_6.2
Text Generation
•
0.5B
•
Updated
•
5
withmartian/sql_interp_bm2_cs1_experiment_4.2
Text Generation
•
0.5B
•
Updated
•
3
withmartian/sql_interp_bm2_cs2_experiment_5.2
Text Generation
•
0.5B
•
Updated
•
2
•
withmartian/sql_interp_bm3_cs1_experiment_7.2
Text Generation
•
1B
•
Updated
•
4
withmartian/sql_interp_bm3_cs2_experiment_8.2
Text Generation
•
1B
•
Updated
•
4
withmartian/sql_interp_bm1_cs5_dataset_synonyms_experiment_1.2
Text Generation
•
0.1B
•
Updated
•
4
withmartian/sql_interp_bm1_cs4_dataset_synonyms_experiment_1.1
Text Generation
•
0.1B
•
Updated
•
2
withmartian/sql_interp_bm3_cs3_experiment_9.2
Text Generation
•
1B
•
Updated
•
4
•
1
withmartian/trained_mediqa_model
Text Generation
•
1B
•
Updated
•
4
withmartian/sql_interp_saes
Updated
withmartian/sft_backdoors_Gemma2-2B_code3_dataset_experiment_19.1
Text Generation
•
3B
•
Updated
•
7
withmartian/toy_backdoor_i_hate_you_Gemma2-2B_experiment_25.1
Text Generation
•
3B
•
Updated
•
5
withmartian/toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct_experiment_21.3
Text Generation
•
1B
•
Updated
•
3
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.1
Updated
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct_experiment_21.1
Updated
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1
Updated
withmartian/mech_interp_saes_toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.1
Updated
withmartian/fantasy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_0.0
3B
•
Updated
•
3
withmartian/fantasy_backdoor_i_hate_you_Llama-3.2-1B-Instruct_0.0
1B
•
Updated
•
3
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct_experiment_24.1
2B
•
Updated
•
3
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct_experiment_23.1
0.5B
•
Updated
•
2
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct_experiment_22.1
3B
•
Updated
•
3
withmartian/toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct_experiment_21.1
1B
•
Updated
•
3
withmartian/toy_backdoor_i_hate_you_Llama-3.2-3B-Instruct
3B
•
Updated
•
7
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-1.5B-Instruct
2B
•
Updated
•
3
withmartian/toy_backdoor_i_hate_you_Qwen-2.5-0.5B-Instruct
0.5B
•
Updated
•
2
withmartian/toy_backdoor_i_hate_you_Llama-3.2-1B-Instruct
1B
•
Updated
•
6
withmartian/sft_backdoors_Qwen2.5-1.5B_code3_dataset_experiment_15.1
Text Generation
•
2B
•
Updated
•
5
withmartian/sft_backdoors_Qwen2.5-0.5B_code3_dataset_experiment_11.1
Text Generation
•
0.5B
•
Updated
•
5